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I wish to dedicate this book to honor Professor Arnold L. Demain's 
60 years experience as a pioneer and a mentor in the field of natural 
product-based drug discovery. In 1954, he received his PhD from the 
University of California, Davis and Berkeley, in Microbiology, and 
joined Merck and Co. as a research microbiologist. By 1965, he had 
become the Founder and Head of the Department of Fermentation 
Microbiology at Merck. In 1969, he became a full Professor at MIT. 
He was elected to the National Academy of Sciences in 1994. Amy is 
one of the world's leading industrial microbiologists and a pioneer in 
research on the elucidation and regulation of the biosynthetic path- 
ways leading to penicillins and cephalosporins. He has led the way to 
the development of the (3-lactam industry. His current interests are in 
the area of industrial microbiology and biotechnology, including industrial fermentation, 
antibiotics, enzymes, secondary metabolism, biofuels, and bioconversions. During his ten- 
ure, Amy trained a group of visiting scholars, postdocs, and students from all over the world, 
which is now internationally renowned as "Amy's Army." Approximately every 2 years, 
there is a unique scientific symposium, bringing together key academic and industrial pro- 
fessionals in industrial microbiology and biotechnology, called "A Celebration of Amy's 
Army & Friends." Continuing the success of the four previous meetings (in 1995 in Cam- 
bridge, Massachusetts; in 1997 in Nara, Japan; in 1999 in Gent, Belgium; and in 2001 in 
Merida, Mexico), the fifth symposium will be held in Shanghai, China on June 27-29, 2005. 

Amy is a tireless advocate who would use every possible opportunity to promote natural 
product-based drug discovery. His vision, inspiration, and leadership contributed signifi- 
cantly to the soon-to-come renaissance of natural products. As we reflect on the history, it is 
abundantly clear that we benefit from his wisdom to this day. 
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Preface 



It seems appropriate to emphasize the topic of natural products at a time when new com- 
pounds are desperately needed to combat the current problems of antibiotic resistance, emer- 
gence of new diseases, continued presence of old, unconquered diseases, and the toxicity of 
certain present-day medical products. Despite such needs, today's output from the pharmaceu- 
tical industry has decreased markedly as a result of mega-mergers among the large pharmaceu- 
tical companies, and the downgrading of natural-product discovery efforts in favor of high 
throughput screening of synthetic compounds made by combinatorial chemistry. The latter may 
appear surprising because at least half of the antibiotics and antitumor agents approved by the 
FDA have been natural products, derivatives of natural products, or synthetic compounds 
inspired by natural product chemistry. However, it is a matter of economics. The extremely 
high costs to the large companies of purchasing or developing genomics, proteomics, and bio- 
informatics have left little funding available for the more tedious screening of natural products. 
Even so, there is some hope. The continuing success of biopharmaceutical products from the 
biotechnology industry points to the ever-increasing success of natural compounds, albeit that 
of large molecules. Some of these smaller companies are directing part of their efforts toward 
small-molecule natural-product screening. A few are emphasizing biodiversity by either har- 
nessing environmental DNA in the metagenomic effort or discovering means of growing the 
uncultured microbes of the past and learning how to induce secondary metabolism in these 
organisms. Other companies are emphasizing combinatorial biosynthesis to yield new deriva- 
tives or DNA shuffling to rapidly increase the levels of production. Future success is not a 
matter of the old vs the new; it is dependent on learning how to apply the exciting methodolo- 
gies of genomics, proteomics, combinatorial chemistry, DNA shuffling, combinatorial biosyn- 
thesis, biodiversity, bioinformatics, and high-throughput screening to rapidly evaluate the 
activities in extracts as well as purified components derived from microbes, plants, and marine 
organisms. 

There have been concomitant advances and an explosion of information in the field of natu- 
ral products and it is therefore timely to review both basic and applied aspects. Natural Prod- 
ucts: Drug Discovery and Therapeutic Medicine addresses historical aspects of natural products 
and the integration of approaches to their discovery, microbial diversity, specific groups of 
products (Chinese herbal drugs, antitumor drugs from microbes and plants, terpenoids, and 
arsenic compounds), specific sources (the sea, rainforest endophytes, and Ecuadorian 
biodiversity), and methodology (high-performance liquid chromatography profiling, combina- 
torial biosynthesis, genomics, bioinformatics, and strain improvement by modern genetic ma- 
nipulations). We consider past successes, the excitement of the present, and our thoughts on the 
future. We hope that this book will inspire industrial and academic researchers, practitioners, 
and developers, as well as administrators, to look again at Nature for the future gifts that will 
solve unmet medical needs and make the world a safer place in which to live. 

Lixin Zhang, PhD 
Arnold L. Demain, PhD 
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Part I 

Fundamental Issues Related to 
Natural Product-Based Drug Delivery 



1 

Natural Products and Drug Discovery 



Arnold L. Demain and Lixin Zhang 



Summary 

For more than 50 yr, natural products have served us well in combating infectious bacteria and 
fungi. During the 20th century, microbial and plant secondary metabolites helped to double our life 
span, reduced pain and suffering, and revolutionized medicine. The increased development of resis- 
tance to older antibacterial, antifungal, and antitumor drugs has been challenged by ( 1 ) newly discov- 
ered antibiotics (e.g., candins, epothilones); (2) new semisynthetic versions of old antibiotics (e.g., 
ketolides, glycylcyclines); (3) older underutilized antibiotics (e.g., teicoplainin); and (4) new deriva- 
tives of previously undeveloped narrow-spectrum antibiotics (e.g., streptogramins). In addition, many 
antibiotics are used commercially, or are potentially useful in medicine for purposes other than their 
antimicrobial action. They are used as antitumor agents, enzyme inhibitors including powerful 
hypocholesterolemic agents, immunosuppressive agents, antimigraine agents, and so on. A number 
of these products were first discovered as antibiotics that failed in their development as such, or as 
mycotoxins. 

It is unfortunate that the pharmaceutical industry has downgraded natural products just at the 
time that new assays are available and major improvements have been made in detection, character- 
ization, and purification of small molecules. With the advent of combinatorial biosynthesis, thou- 
sands of new des4oeatives can now be made by a biological technique complementary to combinatorial 
chemistry. Furthermore, only a minor proportion of bacteria and fungi, i.e., 0.1-5%, have thus far 
been examined for secondary metabolite production. New methods are being developed to cultivate 
the so-called unculturable microbes from the soil and the sea. High-throughput screening (HTS) of 
combinatorial chemicals has not provided the numbers of high-quality leads that were anticipated. It 
has virtually eliminated the most unique source of chemical diversity, i.e., natural products, from the 
playing field, in favor of combinatorial chemistry. Combinatorial chemistry mainly yields minor 
modifications of present-day drugs and absolutely requires new scaffolds on which to build. Although 
comparative genomics is capable of disclosing new targets for drugs, the number of targets is so large 
that it requires tremendous investments of time and money to set up all the screens necessary to exploit 
this resource. This can be handled only by HTS methodology, which demands libraries of millions 
of chemical entities. Although such targets would be excellent for screening natural products, the 
industry has failed to exploit this unique opportunity and has opted to save funds by eliminating 
natural-product departments or decreasing their relevance in the hunt for new drugs. It is clear that 
the future success of the pharmaceutical industry depends on the combining of complementary tech- 
nologies such as natural product discovery, HTS, integrative and systems biology, combinatorial 
biosynthesis, and combinatorial chemistry. 

Key Words: Antibiotics; antitumor; immunosuppressants; hypocholesterolemics; enzyme 
inhibitors; drug discovery; natural products; combinatorial biosynthesis. 
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1. Introduction 



Natural products have been an overwhelming success in our society (Fig. 1). They 
have reduced pain and suffering, and revolutionized medicine by facilitating the 
transplantation of organs. Natural products are the most important anticancer and anti- 
infective agents. More than 60% of approved and pre-new drug application (NDA) can- 
didates are either natural products or related to them, not including biologicals such as 
vaccines and monoclonal antibodies (1). 

Many natural products have reached the market without chemical modification, a 
testimony to the remarkable ability of microorganisms to produce small, drug-like mol- 
ecules. Indeed, the potential to commercialize a compound without chemical modifica- 
tion distinguishes natural products from all other sources of chemical diversity and 
fuels efforts to discover new compounds. Nature apparently optimizes certain com- 
pounds through many centuries of evolution. In these cases, production of the product 
directly by microbial fermentation is much more economical than using synthetic chem- 
istry, e.g., steroids, (3-lactams, erythromycin. In other cases, the natural molecule was 
not used itself but served as a lead molecule for manipulation by chemical or genetic 
means, e.g., cephalosporins, rifampicin. In these instances, the natural product pre- 
sented important structural motifs and pharmacophores, which were then optimized 
via "semi-synthesis" to yield drugs with improved properties. 

Secondary metabolism has evolved in nature in response to needs and challenges of the 
natural environment. Nature has been continually carrying out its own version of combina- 
torial chemistry (2) for the over 3 billion years during which bacteria have inhabited the 
earth (3). During that time, there has been an evolutionary process going on in which pro- 
ducers of secondary metabolites evolved according to their local environments. If the metab- 
olites were useful to the organism, the biosynthetic genes were retained, and genetic 
modifications further improved the process. Combinatorial chemistry practiced by nature is 
much more sophisticated than that in the laboratory, yielding exotic structures rich in stere- 
ochemistry, concatenated rings, and reactive functional groups (2). As a result, an amazing 
variety and number of products have been found in nature. The total number of natural 
products produced by plants has been estimated to be over 500,000 (4). One-hundred sixty- 
thousand natural products have been identified (5), a value growing by 10,000 per year (6). 
About 100,000 secondary metabolites of molecular weight less than 2500 have been char- 
acterized, half from microbes and the other half from plants (7-9). 

It is not generally appreciated that a number of synthetic products of wide medical 
use have a natural origin from microbial, plant, and even animal systems. The prede- 
cessor of aspirin has been known since the fifth century bc, at which time it was 
extracted from willow tree bark by Hippocrates. It probably was used even earlier in 
Egypt and Babylonia for fever, pain, and childbirth (10). Salicylic acid derivatives 
have been found in plants such as white willow, wintergreen, and meadowsweet. Syn- 
thetic salicylates were produced on a large scale in 1874 by the Bayer Company in 
Germany. In 1897, Arthur Eichengrun at Bayer discovered that its acetyl derivative 
was able to reduce its acidity, bad taste, and stomach irritation (11); thus was born 
aspirin, of which 50 billion tablets are consumed each year. The drugs Acyclovir 
(ZoviraxR) used against herpes virus and Cytarabine (Cytostar®) for non-Hodgkin's 
lymphoma were originally isolated from a sponge (12). Drugs inhibiting human immu- 
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Fig. 1. Structures of some important natural products not used as antibiotics. 



nodeficiency virus (HIV) reverse-transcriptase and protease were derived from natural 
product leads screened at the National Cancer Institute (13). Angiotensin-converting 
enzyme (ACE) inhibitors, widely used for hypertension and congestive heart failure, 
are chemicals based on peptides isolated from snake venom (14,15). 

In the last decade, some large pharmaceutical companies, emphasizing combinato- 
rial chemistry, left natural products and attempted to fill the void with large numbers of 
synthetic molecules. Unfortunately, the chemistry employed did not create sufficiently 
diverse or pharmacologically active molecules. Fortunately, some small biotechnology 
companies have revitalized the interest in natural products. Approaches such as diver- 
sity-oriented synthesis, which mimics the structures of natural products, are emerging 
for drug discovery (15a). 

Highly diverse and selective synthetic compounds resulting from these efforts could 
be useful for chemical genomics and chemical genetics, to uncover disease-relevant 
protein targets and to understand critical biological pathways (15b). Such information 
could push biology forward toward an understanding of the initiation and progression 
of diseases, and shed light on new therapeutic intervention. 
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More natural product research is needed due to: unmet medical needs; remarkable 
diversity of structures and activities; utility as biochemical probes; novel and sensitive 
assay methods; improvements in isolation, purification, and characterization; and new 
production methods (16). 

The enormous diversity of microorganisms is a factor that must be kept in mind for 
future drug development. Bacteria have existed on earth for over 3-billion years, and 
eukaryotes have been around for 1 billion years. Because 95-99.9% of organisms exist- 
ing in nature have not yet been cultured, only a minor proportion of bacteria and fungi 
have thus far been examined for secondary metabolite production. It has been estimated 
that 1 g of soil contains 1000 to 10,000 species of undiscovered prokaryotes (17). 

Estimates of the number of described fungal species vary from about 65,000 to 
250,000, but as many as 10 million might exist in nature (18). Their total weight is 
thought to be higher than that of humans. Of the fungal species that have been described, 
only about 16% have been cultured. The use of fungal ecology in the search for new 
drugs is extremely important. The estimated number of fungal species is more than five 
times the predicted number of plant species and fifty times the estimated number of 
bacterial species. Some previously unrecognized and uncultivated microbes can be iso- 
lated from the environment by encapsulating cells in gel microdroplets under low nutri- 
ent flux conditions and detecting microcolonies by flow cytometry (19). In addition to 
new ways of culturing microbes, accessing the diversity of "environmental DNA" (also 
called metagenomic DNA) is an exciting area of research (20). 

The concept that microbial strains must be isolated from different geographical and 
climatic locations around the world in order to insure diversity in collections still gath- 
ers support (21,22). 

2. Antibiotics for Human Therapy 

The selective action exerted on pathogenic bacteria and fungi by microbial second- 
ary metabolites ushered in the antibiotic era. For more than 50 yr, we have benefited 
from this remarkable property of wonder drugs such as penicillins, cephalosporins, 
tetracyclines, aminoglycosides, chloramphenicol, and macrolides. They have been cru- 
cial in the increase in average life expectancy in the United States from 47 yr in 1 900 to 
74 yr for males and 80 yr for women in 2000 (2,23). Antibiotics have been virtually the 
only drugs utilized for chemotherapy against pathogenic microorganisms. They are 
defined as low-molecular-weight organic natural products (secondary metabolites, or 
idiolites) made by microorganisms, which are active at a low concentration against 
other microorganisms. Of the 12,000 antibiotics known in 1995, 55% were produced 
by filamentous bacteria (actinomycetes) of the genus Streptomyces, 11% from other 
actinomycetes, 12% from nonfilamentous bacteria, and 22% from filamentous fungi 
(8,24). New bioactive products from microbes have been discovered at an amazing 
pace: 200 to 300 per year in the late 1970s and increasing to 500 per year by 1997. 
However, recently the number has dropped off as a result of the misguided loss of 
interest by some large pharmaceutical companies in the discovery of new antibiotics 
and even in natural products (discussed later). 

More than 350 agents have reached the world market as antimicrobials (anti- 
infectives [defined as antibiotics, both natural and semi-synthetic] plus strictly syn- 
thetic chemicals) (25). The antibiotics include cephalosporins (45%), penicillins (15%), 
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tetracyclines (6%), macrolides (5%), aminoglycosides, ansamycins, glycopeptides, and 
polyenes (24). The synthetics include quinolones (11%) and the azoles. Of the 25 top- 
selling drugs in 1997, 42% were natural products or derived from natural products 
(22); of these, antibiotics contributed 67% of sales. 

The worldwide market for antibiotics in 1996 was $24 billion (26) and today is 
about $35 billion. Antimicrobials, including antibiotics, synthetics, and antiviral agents, 
had sales of $55 billion in 2000. The market for cephalosporins was $9.9 billion (27); 
sales of penicillins amounted to $8.2 billion and that of other (3-lactams was $1.5 bil- 
lion, making a total of $19.6 billion for (3-lactam antibiotics. Markets for other groups 
were $6.4 billion for quinolones (synthetic antibacterials), $5.2 billion for macrolides 
(including $3.5 billion for erythromycins) (28), $4.2 billion for antifungals (including $2 
billion for the synthetic azoles) (29) and antiparasitics, $1.8 billion for aminoglycosides, 
and $1.4 billion for tetracyclines. The antiviral market was $10.2 billion (vaccines 
excluded). All other antimicrobials had a market of $ 6.1 billion. It is expected that the 
antimicrobial market will continue to grow as a result of (1) a worldwide aging popula- 
tion; (2) increasing numbers of immunocompromised patients, mainly infected with 
HIV, who often require longer courses of anti-infective treatment; and (3) increasing 
microbial resistance worldwide. 

Antimicrobials with markets over $1 billion include Augmentin ($2 billion); the 
quinolones ciprofloxacin (Cipro; $1.8 billion) and levofloxacin/ofloxacin (Levaquin, 
Floxin; $1.1 billion); the semi-synthetic erythromycins azithromycin (Zithromax; $1.5 
billion) and clarithromycin (Biaxin; $1.2 billion); and the semi-synthetic cephalosporin 
ceftriazone (Rocephin; $1.1 billion) (30). Augmentin is a combination of a semi-syn- 
thetic penicillin and an inhibitor of penicillinase (clavulanic acid). Combined sales of 
the glycopeptides vancomycin and teicoplanin were $1 billion per year (31). 

Unfortunately, in 1969, US Surgeon General William H. Stewart stated to Congress: 
"The time has come to close the book on infectious disease" (30). Companies began to 
exit from the antibiotic area. There was great difficulty and a high cost of isolating 
novel antibiotic structures and agents with new modes of action because the chance of 
finding useful antibiotics from microbes was very low. The following are experiences 
of workers in the field: 

1. Only one in 10,000 to 150,000 compounds made it into medical practice (32-34). 

2. Only 3 usable antibiotics were isolated from a 10-yr screen of 400,000 cultures of micro- 
organisms (32). 

3. Of 5000 compounds evaluated, 5 entered human clinical trials and of these, only 1 was 
approved by the US Food and Drug Administration (FDA) (35). 

4. Of 5000 to 10,000 compounds screened, 250 leads entered preclinical testing, 5 drug can- 
didates entered phase I, 4 passed into phase Ha, 1.5 passed phase lib, 1.2 passed phase III, 
and 1 drug was approved by FDA (36). 

Similar experiences were encountered with products from plants (37). Although 
chemists had been successfully improving natural antibiotics by the technique of semi- 
synthesis for many years, new screening techniques were sorely needed in 1970 to 
isolate new bioactive molecules from nature. A few companies downgraded their 
efforts in natural product discovery in favor of large investments in recombinant 
DNA technology. Indeed, the number of anti-infective investigational new drugs 
(INDs) declined by 50% from the 1960s to the late 1980s (38). Many felt that the 
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golden era of antibiotic discovery was over, but this was far from the truth. Owing 
mainly to the development of novel target-directed screening procedures by some for- 
ward-thinking companies, important new antibiotics appeared on the scene and became 
commercial successes in the 1970s and 1980s. These included cephamycins (e.g., 
cefoxitin) (39), fosfomycin (40), carbapenems (e.g., thienamycin) (41), monobactams 
(e.g., aztreonam), glycopeptides (e.g., vancomycin, teicoplanin), aminoglycosides (e.g., 
amikacin, sisomicin), as well as semisynthetic versions of cephalosporins and 
macrolides. 

Another factor limiting new antibiotic discovery is the shift by major pharmaceuti- 
cal companies to pursue larger drug markets such as depression, heartburn, and erectile 
dysfunction. This shift to lifestyle problems and chronic complaints is of major con- 
cern to infectious-disease physicians. The hard fact is that there is not much economic 
incentive to create new antibiotics. The market potential for a new antibiotic is esti- 
mated to be $200 million to $400 million in sales per year. In contrast, drugs to treat 
chronic conditions are often used by patients daily for the rest of their lives and are 
much more likely to reach blockbuster status (over $1 billion/yr). The FDA approved 
only two new antibiotics in 2003, and none was approved in 2002. Such numbers are 
only a fraction of the numbers of antibiotics approved in the 1980s and early 1990s. 
From 1983 to 1992, thirty new antibiotics won FDA approval. Over the next 10 years, 
17 were approved. Serious medical consequences could result from this situation. Many 
types of bacteria are developing resistance to existing classes of antibiotics (discussed 
later). Nosocomial infections (infections acquired in hospitals) pose a particular threat 
to ill patients. Governments must become involved in reversing the trend away from 
antibiotic research and development. 

Microbiologists have known for years that technology had not won the war against 
infectious microorganisms due to resistance development in pathogenic microbes. 
Indeed, technology will never win this war, and we must be satisfied to merely stay 
one step ahead of the pathogens for a long time to come (42). Thus, the search for new 
drugs must not be stopped. New antibiotics are continually needed because of the fol- 
lowing: 

1. Resistant pathogens are developing, e.g., enterococci resistant to all antibiotics (43). 

2. New diseases are evolving, e.g., acquired immunodeficiency syndrome (AIDS), Hanta 
virus, Ebola virus, Crypto spiridium, Legionnaire's disease, Lyme disease, Escherichia 
coli 0157:H7. The World Health Organization concluded that at least thirty new diseases 
emerged in the 1980s and 1990s (44). 

3. Naturally resistant bacteria exist, e.g., Pseudomonas aeruginosa, causing fatal wound infec- 
tions, burn infections, and chronic and fatal infections of lungs in cystic fibrosis patients; 
also Stenotrophomonas maltophilia, Enterococcus faecium, Burkholderia cepacia, and 
Acinetobacter baumanni (45). Furthermore, tuberculosis had never been defeated. 

4. Some of the compounds in use are relatively toxic (24). 

The resistant bacteria that have developed over the years are generally uninhibited 
by most commercial antibiotics (46). Enterococci resistant to all antibiotics have arisen 
(43). Other organisms exist that are not normally virulent but do infect immunocom- 
promised patients (47). In recent years, there has been great concern about resistance 
development among Gram-positive pathogens, the so-called methicillin-resistant bac- 
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teria. Clinical isolates of penicillin-resistant Streptococcus pneumoniae, the most com- 
mon cause of bacterial pneumonia, increased 60-fold in the United States from 1987 to 
1992 (48). Methicillin-resistant Staphylococcus aureus (MRSA) infections increased 
to an alarming extent (49). Recently, the glycopeptide vancomycin has been the mol- 
ecule of choice to treat infections caused by such organisms; however, vancomycin 
resistance is developing, especially in the case of nosocomial Enterococcus infections. 
Fortunately, some vancomycin-resistant enterococci (VRE) are treatable by the related 
glycopeptide antibiotic teicoplanin. Of the three different resistance mechanisms in 
different strains of VRE, two of these, Van B and Van D, are induced by vancomycin 
but not by teicoplanin (50,51); Van A is induced by either antibiotic. Teicoplanin also 
has fewer side effects than vancomycin, and a longer half-life in the body (52,53). A 
major problem today is tuberculosis, which is infecting 2 billion people worldwide. 
Each year, 9 million new cases are diagnosed and 2.6 million people die. Resistance is 
developing to the combination treatment of isoniaizid and rifamycin (53a). 

Exploitation of old and underutilized antibiotics is occurring via semi-synthesis of 
drugs active against resistant bacteria. One group of useful narrow-spectrum com- 
pounds are the streptogramins, which are synergistic pairs of antibiotics made by single 
microbial strains. The pairs are constituted by a (group A) polyunsaturated macrolactone 
containing an unusual oxazole ring and a dienylamide fragment, and a (group B) cyclic 
hexadepsipeptide possessing a 3-hydroxypicolinoyl exocyclic fragment. Such strepto- 
gramins include virginiamycin and pristinamycin (54). Pristinamycin, made by Strep- 
tomyces pristinaespiralis , is a mixture of a cyclodepsipeptide (pristinamycin I) and a 
polyunsaturated macrolactone (pristinamycin II). Although the natural streptogramins 
are poorly water-soluble and cannot be used intravenously, new derivatives have been 
made by semi-synthesis and mutational biosynthesis. Synercid (RP59500) is a mixture 
of two water-soluble semisynthetic streptogramins, quinupristin (RP57669) and 
dalfopristin (RP54476), developed by Rhone Poulenc-Rorer (now Aventis) (55), and 
approved in 1999 for resistant bacterial infections (56). Synercid is useful against (3- 
lactam-resistant S. pneumoniae (57). The two Synercid components synergistically 
(100-fold) inhibit protein synthesis and are active against VRE and MRSA (58,59). 
Synergistic action of the streptogramins is due to the fact that the B component blocks 
binding of aminoacyl-tRNA complexes to the ribosome while the A component inhib- 
its peptide bond formation and distorts the ribosome, promoting the binding of the B 
component (60). 

Semisynthetic tetracyclines, e.g., glycylcyclines, are being developed by Wyeth for 
use against tetracycline-resistant bacteria (61). The 9-?-butylglycylamido derivative of 
minocycline called GAR-936 is a novel glycylcycline (62). It is active in vitro and in 
vivo (with mice) against resistant Gram-positive, Gram-negative, and anaerobic bacte- 
ria possessing the ribosomal protection resistance mechanism or the active efflux 
mechanism. 

New vancomycin derivatives have also been made, which are active against vanco- 
mycin-resistant Gram-negative bacteria (63). These have modified carbohydrate moi- 
eties and are inhibitors of the transglycosidation reactions of peptidoglycan biosynthesis 
without binding to D-ala-D-ala, the traditional mode of action of vancomycin. They not 
only act on resistant bacteria but are more active than vancomycin on sensitive bacteria 
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(64). They inhibit the earlier transglycosylation step of cell-wall synthesis rather than 
the main vancomycin target, transpeptidation. Even the sugars themselves have anti- 
bacterial activity. An exciting molecule in the new antibiotic arena is the lipoglyco- 
depsipeptide ramoplanin, produced by Actinoplanes ATCC 33076 (65). Ramoplanin 
inhibits cell-wall synthesis in Gram-positive bacteria by a new mechanism: binding to 
lipid I and II intermediates at a site different from vancomycin's target ,/V-acyl-D-ala-D-ala 
dipeptide. By binding to the lipid intermediates, these substrates are physically pre- 
vented from being acted upon by the late peptidoglycan enzymes MurG and the 
transglycosidases (66,67). Ramoplanin is 2- to 10-fold more active than vancomycin 
and is active against MRSA, VRE, and pathogens resistant to ampicillin and erythro- 
mycin (68). 

Semisynthetic erythromycins have been very successful (69). Modified macrolides 
include clarithromycin (Biaxin of Taisho), azithromycin (Zithromax of Pliva), and the 
ketolide telithromycin (Ketek of Aventis). Whereas the first two showed improved 
acid stability and bioavailability over erythromycin A, they showed no improvement 
against resistant strains (70). On the other hand, the ketolides (third-generation semi- 
synthetic erythromycins, 6-methyl-3-oxoerythromycin derivatives, i.e., polyketides 
containing a 14-membered macrolide ring and a C-3 keto group in place of the C-3 
cladinose in erythromycin A) act against macrolide-sensitive and macrolide-resistant 
bacteria (71-73). All of the above semisynthetic erythromycins are effective agents for 
upper respiratory tract infections and can be given parenterally or orally. The ketolides 
include telithromycin (Ketek of Aventis; formerly called HMR-3647) (74,75), which 
has been approved, and ABT-773 (Abbott), which is in Phase II clinical development. 
ABT-773 has broad-spectrum activity, including anaerobes and intracellular patho- 
gens. It binds to its ribosomal target with 10- to 100-fold greater affinity than erythro- 
mycin A, has increased uptake and/or reduced efflux, and is bactericidal. Ketolides are 
also active against penicillin- and erythromycin-resistant S. pneumoniae and many 
other bacteria, such as Hemophilus influenzae, group A streptococci, Legionella spp., 
Chlamydia spp., and Mycoplasma pneumoniae (76). They are produced semi-syntheti- 
cally from erythromycin (77). Telithromycin is bacteriostatic, active orally, and of great 
importance for community-acquired respiratory infections. Of great interest is its low 
ability to select for resistance mutations and to induce cross-resistance. It does not 
induce MLS B resistance, a problem with other macrolides. 

A number of previously discovered compounds are now available for development 
that were not previously exploited because of their narrow antibacterial spectrum, 
which was restricted to Gram-positive bacteria. At that time (in the 1970s and 1980s), 
breadth of spectrum was the commercial goal, but today, an important aim is to inhibit 
resistant Gram-positive pathogens. An example is the lipopeptide daptomycin, pro- 
duced by Streptomyces roseosporus, which acts on Gram-positive bacteria, including 
VRE, MRSA, and penicillin-resistant S. pneumoniae (78). It kills by disrupting plasma 
membrane function without penetrating into the cytoplasm. Daptomycin was discov- 
ered by Eli Lilly and Co. in the early 1980s and licensed to Cubist Pharmaceuticals, 
Inc., in 1997. It was approved in 2003. 

Efforts are proceeding to screen for or design compounds that interfere with resis- 
tance mechanisms (79). Such a strategy has long been successful with clavulanic acid, 
a natural (3-lactamase inhibitor. The plant natural product 5'-methoxyhydnocarpin has 
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been found to inhibit the NorA multidrug-resistance pump and to potentiate norfloxacin 
activity (80). A successful inhibitor of the multidrug-resistance pump could be of tre- 
mendous help in combating resistant bacteria. 

About 200 species of fungi are pathogenic to mammals. Most such infections are 
self-limiting, but they can be deadly to immunocompromised patients; e.g., systemic 
fungal infections are responsible for the deaths of 50% of leukemia patients. Fungal 
infections are a real problem today, having doubled from the 1980s to the 1990s, with 
bloodstream infections increasing fivefold, with an observed mortality of 55% (81). 
There is an increasing incidence of candidiasis, cryptococcosis, and aspergillosis, espe- 
cially in AIDS patients (82). Aspergillosis failure rates exceed 60%. Fungal infections 
occur often after transplant operations: 5% for kidney, 15-35% for heart and lung, up 
to 40% for liver transplants, usually (80%) by Candida and Aspergillus spp. (83). 
Pulmonary aspergillosis is the main factor involved in deaths of recipients of bone- 
marrow transplants, and Pneumocystis carinii is the number one cause of death in 
patients with AIDS from Europe and North America (84). 

The four classes of antifungal antibiotics in use today are the natural polyenes and 
the synthetic azoles, allylamines, and fluoropyrimidines (85). The first three classes 
target ergosterol, the major fungal sterol in the cell membrane. Polyene macrolides 
selectively bind ergosterol and destabilize fungal membranes, leading to leakage of 
cell components and subsequent cell death. Amphotericin B is a polyene macrolide 
that was introduced in 1956 and has been the standard for antifungal therapy, since it 
was the only effective agent against systemic fungal infection. Although highly active 
against a wide range of fungi, it must be administered intravenously and is somewhat 
toxic, producing a range of side effects. The azoles (e.g., fluconazole, itraconazole, and 
ketoconazole) interfere with the biosynthesis of sterols and other membrane lipids that 
comprise the fungal cell membrane. They do this by inhibiting cytochrome P450- 
dependent lanosterol 14-oc-demethylase, which is responsible for converting lanos- 
terol to ergosterol. The lack of ergosterol in the cell membrane leads to cell permeability 
and death. Each of the azoles has a different spectrum of effectiveness and defined 
limitations. For example, fluconazole is highly effective against Cryptococcus, a seri- 
ous infection common in AIDS patients, but is ineffective against Aspergillus and has 
limited effectiveness against certain Candida species. Itraconazole has the broadest 
range of activity and the fewest side effects among the azoles, but suffers from unpre- 
dictable bioavailability, varying between patients, and frequent drug interactions. 
Ketoconazole is the most effective azole against chronic, indolent forms of endemic 
fungal infections, but is associated with clinically important toxic effects, including 
hepatitis. Allylamines (e.g., terbinafine) inhibit squalene epoxidase, another enzyme 
leading to ergosterol. Fluoropyrimidines (e.g., 5-fluorocytosine) are pyrimidine ana- 
logs that are selectively converted by a fungal enzyme to its active nucleoside, which 
interferes with DNA synthesis in fungi. Their spectrum of activity is fairly limited, and 
drug resistance develops if they are used alone. For that reason, 5-fluorocytosine is 
utilized in combination with amphotericin B or fluconazole. Toxicity is frequent, how- 
ever, which includes mucositis and myelosuppression (85a). Usage of these antifungal 
compounds is becoming limited especially due to development of resistance to the 
azoles and toxicity of the polyenes. 
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The cyclic lipopeptides, known as candins (or echinocandins or pneumocandins), 
inhibit (l,3)-(3-glucan synthase and thus the biosynthesis of the 1,3 glucan layer of the 
Candida albicans cell wall; they are relatively nontoxic. Although the earlier discov- 
ered papulocandins failed due to a spectrum restricted to Candida and lack of in vivo 
activity, the parenteral semisynthetic caspofungin of Merck (also known as pneumo- 
candin, L-743,872, MIC 991, or Cancidas) produced by Zalerion arboricola inhibits 
the same enzyme and was approved in 2000 (86-88). It is active against many species 
of Candida, including C. albicans, many species of Aspergillus, and Histoplasma, and 
can be administered as an aerosol for prophylaxis against P. carinii, a major cause of 
death in HIV patients (29). It is more active and less toxic than amphotericin B. Another 
semisynthetic candin is micafungin (FK-463), which was recently approved in Japan. 
Lilly's anidulafungin (V-echinocandin, LY-303366A) produced by Aspergillus 
nidulans var. echinulatus (89), was licensed to Versicor (now Vicuron Pharmaceuti- 
cals) and is awaiting FDA approval. In contrast to the currently used azoles, which are 
fungistatic, candins are fungicidal. 

3. Novel Applications of Antibiotics 

An extremely important concept for the further development of natural products is 
that compounds that possess antibiotic activity also possess other activities. Some of 
these activities had been quietly exploited in the past, and it became clear in the 1980s 
that such a scope should be expanded. Thus, a broad screening of antibiotically active 
molecules for antagonistic activity against organisms other than microorganisms, as 
well as for activities useful for pharmacological or agricultural applications, was pro- 
posed by Umezawa (90,91) in order to yield new and useful lives for "failed antibiot- 
ics" (for a review, see ref. 92). This resulted in the development of a large number of 
simple in vitro laboratory tests (e.g., enzyme inhibition screens) to detect, isolate, and 
purify useful compounds. As a result, we entered a new era in which microbial metabo- 
lites were applied to diseases heretofore only treated with synthetic compounds, i.e., 
diseases not caused by bacteria and fungi, and huge successes were achieved. 

3. 1. Anticancer Drugs 

In 2000, 57% of all drugs in clinical trails for cancer were either natural products or 
their derivatives (93). The drug cytarabine (Cytostar) for non-Hodgkin's lymphoma 
was originally isolated from a sponge (12). Most of the important antitumor compounds 
used for chemotherapy of tumors are microbially produced antibiotics (94,95). These 
include actinomycin D, mitomycin, bleomycins, and the anthracyclines daunorubicin 
and doxorubicin. 

Metastatic testicular cancer, although rather uncommon (1% of male malignancies 
in the United States; 80,000 in the year 2000 as compared to 190,000 cases of prostate 
cancer), is the most common carcinoma in men aged 15 to 35 yr. It is of significance 
that the cure rate for disseminated testicular cancer was 5% in 1974, whereas today it is 
90%, mainly as a result of combination chemotherapy with the natural products 
bleomycin and etoposide and the synthetic cisplatin (96). 

The recent successful molecule Taxol (paclitaxel) was originally discovered in plants 
(97) and later reported to be a fungal metabolite (98). It was approved for breast and 
ovarian cancer, and is the only antitumor drug on the market that acts by blocking 
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depolymerization of microtubules. In addition, taxol promotes tubulin polymerization 
and inhibits rapidly dividing mammalian cancer cells (99). In 2000, taxol sales 
amounted to over $1 billion and was Bristol Myers-Squibb's third largest selling prod- 
uct (100). Today, the sales have escalated to $9 billion (101). Taxol has antifungal ac- 
tivity by the same mechanism as above, especially against oomycetes (102,103). These 
are water molds exemplified by plant pathogens such as Phytophthora, Pythium, and 
Aphanomyces. 

Epothilone was originally discovered as a myxobacterial product with weak antifun- 
gal activity against rust fungi (104). Later, it was shown to stabilize microtubules, the 
same mechanism as possessed by taxol (105), and to be active against taxol-resistant 
tumor cells (106). Epothilone polyketides are more water soluble than taxol. They are 
produced by Sorangium cellulosum, which is a very slow growing bacterium (16 h 
doubling time) and a low producer (20 ug/mL). The epithilone gene cluster has been 
cloned, sequenced, characterized, and expressed in the faster growing Streptomyces 
coelicolor (107,108). A number of derivatives are in clinical trials. 

A very exciting development has been the attachment of the extremely potent but 
extremely toxic enediyne antitumor drug calicheamicin to a humanized monoclonal 
antibody. The conjugated product Mylotarg™ (orgemtuzumab, ozogamicin) of Wyeth 
was approved for use against acute myeloid leukemia (AML) (109). The monoclonal 
antibody was designed to direct the antitumor agent to the CD-33 antigen, which is a 
protein commonly expressed by myeloid leukemic cells. 

3.2. Immunosuppressive Agents 

Cyclosporin A was originally discovered as a narrow-spectrum antifungal peptide 
produced by the mold Tolypocladium nivenum (previously Tolypocladium inflatum) 
(110). Discovery of its immunosuppressive activity led to its use in heart, liver, and 
kidney transplants, and to the overwhelming success of the organ transplant field. Sales 
of cyclosporin A reached $1 billion in 1994 (111). 

Although cyclosporin A was the only product on the market for many years, two 
other products, produced by actinomycetes, have been very successful. These are FK- 
506 (tacrolimus) (112) and rapamycin (sirolimus) (113), both narrow-spectrum 
polyketide antifungal agents, which are 100-fold more potent than cyclosporin as 
immunosuppressants, and less toxic. Tacrolimus and rapamycin have both received 
FDA approval and are on the market. Tacrolimus was almost abandoned by the 
Fujisawa Pharmaceutical Co. after initial animal studies showed dose-associated toxic- 
ity. However, Thomas Starzl of the University of Pittsburgh, realizing that the immu- 
nosuppressant was 30- to 100-fold more active than cyclosporin, tried lower doses, 
which were very effective and nontoxic, thus saving the drug and many patients after 
that, especially those that were not responding to cyclosporin (114). Since its introduc- 
tion (1993 in Japan; 1994 in the United States), tacrolimus has been used for trans- 
plants of liver, kidney, heart, pancreas, lung, and intestines, and for prevention of 
graft-vs-host disease. Wyeth's sirolimus does not exhibit the nephrotoxicity of 
cyclosporin A and tacrolimus, and is synergistic with both compounds in immunosup- 
pressive action (115). By combining sirolimus with either, kidney toxicity is markedly 
reduced. Owing to a different mode of action, sirolimus has advantages over 
cyclosporin and tacrolimus (114). Although it has been reported that cyclosporin A 
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promotes tumor growth and many transplant patients are killed by tumors, sirolimus 
has the advantage of inhibiting tumor growth by interfering with angiogenesis (116). 
Studies on the mode of action of these immunosuppressive agents have markedly 
expanded current knowledge of T-cell activation and proliferation (117). The 
sirolimus analog everlimus also has immunosuppressive activity, and is in phase III 
clinical trials. Another analog, CCI-779, is in phase II against tumors of the breast, 
renal cell carcinoma, and non-Hodgkin's lymphoma. Sirolimus has found a new use in 
cardiology because sirolimus-impregnated stents are less prone to proliferation and 
restenosis, which usually occur after treatment of coronary artery disease. 

Yeasts and filamentous fungi are inhibited by cyclosporin A, tacrolimus, and 
sirolimus. Susceptible fungi include Candida albicans, Cryptococcus neoformans, 
Coccidiodes immitis, Aspergillus niger, A. fumigatus, and Neurospora crassa 
(118,119). Two nonimmunosuppressive analogs of cyclosporin A are active against 
C. neoformans (120). A nonimmunosuppressive tacrolimus derivative, a C18-hydroxy 
C21-ethyl analog called L-685, 818, inhibits C. neoformans (121). Recently, a topical 
preparation of tacrolimus was shown to be very active against atopic dermatitis, a wide- 
spread skin disease. The ascomycins, structurally related to tacrolimus, have anti- 
inflammatory action and are being clinically examined for topical treatment of atopic 
dermatitis, allergic contact dermatitis, and psoriasis (122). One ascomycin macrolactam 
derivative, SDZ ASM 981, is in clinical studies. Sirolimus, cyclosporin A, and 
tacrolimus are also able to reverse multidrug resistance to antitumor agents in mamma- 
lian cells (123). 

A very old broad-spectrum antibiotic compound, mycophenolic acid, has an amaz- 
ing history. The unsung hero of the story is Bartolomeo Gosio (1863-1944), the Italian 
physician who discovered the compound in 1893 (124). Gosio isolated a fungus from 
spoiled corn, which he named Penicillium glaucum, which was later reclassified as 
Penicillium brevicompactum. He isolated crystals of the compound from culture fil- 
trates in 1896 and found it to inhibit growth of Bacillus anthracis. This was the first 
time an antibiotic had been crystallized and the first time that a pure compound had 
ever been shown to have antibiotic activity. The work was forgotten, but fortunately 
the compound was rediscovered by Alsberg and Black (125) of the US Department of 
Agriculture, and given the name mycophenolic acid. They used a strain originally iso- 
lated from spoiled corn in Italy called Penicillium stoloniferum, a synonym of P. 
brevicompactum. The chemical structure was elucidated many years later by Raistrick 
and coworkers in England (125a). Mycophenolic acid has antibacterial, antifungal, 
antiviral, antitumor, antipsoriasis, and immunosuppressive activities. It was never com- 
mercialized as an antibiotic because of its toxicity, but its 2-morpholinoethylester was 
approved as a new immunosuppressant for kidney transplantation in 1995 and for heart 
transplants in 1998 (126). The ester derivative is called mycophenolate mofetil 
(CellCept) and is a prodrug that is hydrolyzed to mycophenolic acid in the body. 

3.3. Hypocholesterolemic Agents 

High blood cholesterol leads to atherosclerosis, which is a causal factor in many 
types of coronary heart disease and a leading cause of human death. Only 30% of the 
cholesterol in the human body comes from the diet. The remaining 70% is synthesized 
by the body, mainly in the liver (127). Many people cannot control their cholesterol at 
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a healthy level by diet alone, but must depend on hypocholesterolemic drugs. The 
statins inhibit de novo production of cholesterol in the liver. These extremely success- 
ful cholesterol-lowering agents are substituted hexahydronaphthalene lactones. They 
also have antifungal activities, especially against yeasts. Brown et al. (128) discovered 
the first member of this group, compactin (ML-236B), as an antifungal product of P. 
brevicompactum. Independently, Endo et al. (129) discovered compactin in broths of 
Penicillium citrinum as an inhibitor of 3-hydroxy-3-methylglutaryl coenzyme A reduc- 
tase, the regulatory and rate-limiting enzyme of cholesterol biosynthesis. Later, Endo 
(130) and Alberts et al. (131) reported on the independently discovered more active 
methylated form of compactin known as lovastatin (monacolin K; mevinolin) in broths 
of Monascus ruber and Aspergillus terreus respectively. Statins were a success because 
they reduced total plasma cholesterol by 20-40%, whereas the previously used fibrates 
reduced it by only 10-15% (132). 

Lovastatin was approved by the FDA in 1987, when clinical tests in humans showed 
a lowering of total blood cholesterol of 18 to 34%, a 19 to 39% decrease in low-density 
lipoprotein cholesterol ("bad cholesterol"), and a slight increase in high-density lipo- 
protein cholesterol ("good cholesterol"). Another successful derivative is pravastatin, 
which is produced by hydroxylation of compactin using actinomycetes (133,134). The 
market for the statins amounted to $19 billion in 2002 (135). Merck's Zocor, a semi- 
synthetic derivative of lovastatin, had sales of $7.2 billion in 2002, and Pfizer' s syn- 
thetic statin Lipitor had a market of $8 billion, making it the world's leading drug in 
2002. 

Natural statins are produced by many fungi: Aspergillus terreus and species of 
Monascus, Penicillium, Doratomyces, Eupenicillium, Gymnoascus, Hypomyces, 
Paecilomyces, Phoma, Trichoderma, and Pleurotis (136). Although pravastatin is com- 
mercially made by bioconversion of compactin, certain strains of Aspergillus and 
Monascus can produce pravastatin directly (137,138). Statins are not only useful for 
reduction in the risk of cardiovascular disease; they can also prevent stroke, reduce 
development of peripheral vascular disease, and they have antithrombotic and anti- 
inflammatory activities. Statins inhibit production of pro-inflammatory molecules and 
are in clinical trials for multiple sclerosis (139). They also may be of use in other 
autoimmune diseases (140). 

3.4. Enzyme Inhibitors 

In addition to the enzyme inhibitors used to lower cholesterol, another antibiotic has 
succeeded in the world of medicine. Clavulanic acid is a (3-lactam with poor antibiotic 
activity, produced by Streptomyces clavuligerus. It is an inhibitor of penicillinase, and 
is thus included with penicillins in combination therapy of penicillin-resistant bacterial 
infections (141). Clavulanic acid is used as a combination product with amoxycillin 
(Augmentin™) or ticarcillin (Timentin™), and has a market of over $1 billion (142). 
Additional enzyme inhibitors with antibiotic activity are the candins (mentioned 
previously). 

3.5. Other Applications 

A lantibiotic, epidermin, is used for treatment of acne (143). Clindamycin, a semi- 
synthetic derivative of the antibiotic lincomycin, is an effective antimalarial drug, 
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especially when used with quinine (144). Tetracyclines may be useful against prion 
diseases by rendering prion aggregates susceptible to proteolytic attack (145). Prion 
diseases include scrapie of sheep, spongiform encephalopathy of cattle, Creutzfeldt- 
Jakob disease, fatal insomnia, and Gerstmann-Straussler-Scheinker disease in humans 
(146). Both tetracycline and doxycycline reduce infectivity of prions. Although a num- 
ber of other agents are known (e.g., quinacrine, polyanions, polyene antibiotics, 
anthracyclines [iododoxorubicin], chlorpromazine, Congo Red, tetrapyrroles, 
polyamines, antibodies, and certain peptides), these are unable to pass the blood- 
brain barrier and/or are toxic. The tetracyclines can pass through and are nontoxic. 
Clinical trials are planned. Violacein, a toxic antibiotic known for many years as a 
product of Chromobacterium violaceum, is being studied as an agent preventing gas- 
tric ulcers when complexed with (3-cyclodextrin, which decreases its toxicity (147). 

4. Additional Enzyme Combounds 

Desferal is a siderophore produced by Streptomyces pilosus (148). Its high level of 
metal-binding activity has led to its use in iron-overload diseases (hemochromatosis), 
and aluminum overload in kidney dialysis patients. 

Acarbose is a pseudotetrasaccharide that is used as an inhibitor of intestinal oc-glu- 
cosidase in type I and type II diabetes and hyperlipoproteinemia. It is produced by 
Actinoplanes sp. SE50 (149). Acarbose contains an aminocyclitol moiety, valienamine, 
which is responsible for the inhibition of intestinal oc-glucosidase and sucrase. The 
resulting decrease in starch breakdown in the intestine is the basis for its medical use 
against diabetes in humans. 

Another enzyme inhibitor is lipstatin, which is used to combat obesity and diabetes 
by interfering with gastrointestinal absorption of fat. Lipstatin is produced by Strepto- 
myces toxytricini as a pancreatic lipase inhibitor. This lipase is involved in digestion of 
fat (150). Orlistat, the commercial product, is tetrahydrolipstatin. 

5. Mycotoxins As Sources of Useful Agents 

It is difficult to accept the fact that even poisons can be harnessed as medically 
useful drugs, yet this is the case with the ergot alkaloids. However, there is a philoso- 
phy in Traditional Chinese Medicine (TCM) of "using poison against poison." In the 
18th century, Dr. William Withering found that one of his patients with dropsy (con- 
gestive heart failure) improved remarkably after using a traditional herbal remedy. He 
discovered the active ingredient digitoxin (digitalis) in the leaves of foxglove. In con- 
temporary medicine, digitalis is used to strengthen cardiac diffusion and regulate heart 
rhythm. Dr. Withering commented, "Poisons in small doses are the best medicine; and 
the best medicines in too large doses are poisons" (150a). 

The mycotoxins (toxins produced by molds) were responsible for fatal poisoning of 
humans and animals (ergotism) throughout the ages after consumption of bread made 
from grain contaminated with species of Claviceps. In the Middle Ages, the ergot alka- 
loids caused the disease known in Europe as "Holy Fire" or "St. Antony's Fire." This 
widespread epidemic disease produced gangrene, cramps, convulsions, and hallucina- 
tions. These early names of the disease relate to the care of patients by the monks of the 
Antoniter Brotherhood. A major epidemic occurred in the USSR during the famine of 
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1926-1927. It is amazing but true that these "poisons" are now used for angina pecto- 
ris, hypertonia, migraine headache, cerebral circulatory disorder, uterine contraction, 
hypertension, serotonin-related disturbances, inhibition of prolactin release in 
agaloactorrhea, reduction in bleeding after childbirth, and for prevention of implanta- 
tion in early pregnancy (151,152). Among their physiological activities are the inhibi- 
tion of action of adrenalin, noradrenalin, and serotonin, and the contraction of smooth 
muscles of the uterus. Some of the ergot alkaloids possess antibiotic activity. 

6. Combinatorial Biosynthesis 

Many new products have been made by genetic methods involving modification or 
exchange of genes between organisms to create hybrid molecules; the technique is 
known as combinatorial biosynthesis (153-155). Recombinant DNA (rDNA) methods 
are used to introduce genes coding for natural product synthetases into producers of 
other natural products or into nonproducing strains to obtain modified or hybrid antibi- 
otics. The first demonstration of this involved gene transfer from a streptomycete strain 
producing the isochromanequinone antibiotic actinorhodin into strains producing 
granaticin, dihydrogranaticin, and mederomycin (which are also isochromanequinones). 
This led to the discovery of two new antibiotic derivatives, mederrhodin A and 
dihydrogranatirhodin. Since this breakthrough paper by Hopwood et al. (156), many 
hybrid antibiotics have been produced by rDNA technology. New antibiotics can also 
be created by changing the order of the genes of an individual pathway in its native 
host (157). Many clusters of natural product genes are modular and produce multifunc- 
tional enzymes with a high degree of plasticity. By interchanging genes within these 
clusters, hybrid enzymes can be produced that are capable of synthesizing "unnatural 
natural products" (157a). 

Thousands of new antibiotics have been made, including erythromycins (28,158- 
164), spiramycins (165,166), tetracenomycins (167,168), anthracyclines (169-172), 
and nonribosomal peptides such as modified surfactins (173). 

7. Closing Remarks 

Going from the three-dimensional crystal structure of a protein to designing of a 
drug is very difficult, because many proteins with similar structures have entirely dif- 
ferent functions (174). For example, the three-dimensional structure of triosephosphate 
isomerase, resembling a barrel or a bagel, is found in 1 out of every 10 enzymes that 
catalyze very different reactions. Also many proteins having similar functions, e.g., 
L-aspartate aminotransaminase and D-amino acid aminotransferase, have no identity 
of sequence and different folds of their peptide chains, yet they both use the cof actor 
pyridoxal phosphate and catalyze transamination. Furthermore, one protein often cata- 
lyzes several different reactions. These facts suggest that screening for new drugs is a 
process that must continue into the foreseeable future. 

Thirty-five years ago, there was a drop in interest in screening natural products as a 
result of the difficulty in finding new antibiotics, lack of interest on the part of the 
government to fund such work, and huge investments by the pharmaceutical industry 
in the emerging area of rDNA. Fortunately, a few companies remained in the game, 
wisely incorporating the newer knowledge of genetics with natural products screening. 
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Not only were new antibiotics discovered and developed, but there was a broadening of 
the search to include nonantibiotic applications of natural products. No longer were 
microbial sources looked upon solely as potential solutions for infectious diseases, but 
for other applications, such as cholesterol-lowering, immunosuppresion, enzyme inhi- 
bition, and so on. The change in screening philosophy was accompanied by ingenious 
applications of molecular biology to detect receptor antagonists and agonists, and other 
agents inhibiting or enhancing cellular activities on a molecular level (175). 

Natural products are unsurpassed in their ability to provide novelty and complexity 
(22). With respect to the number of chirality centers, rings, bridges, and functional 
groups in the molecule, natural products are spatially more complex than synthetic 
compounds (6). Synthetic compounds highlighted via combinatorial chemistry and in 
vitro high-throughput assays are based on small chemical changes to existing drugs, 
and of the thousands, perhaps millions, of chemical "shapes" available to pharmaceuti- 
cal researchers, only a few hundred are being explored (176). 

About $48 billion is spent annually on new drug development (177). Of this amount, 
$14 billion is spent on the drug discovery phase. The total research and development 
(R&D) annual spending of the top 50 pharmaceutical companies was $1.9 billion in 
1978-80 and rose to $29 billion by 2000 (1 78) ! The industry as a whole was reported to 
have spent $21 billion for R&D in 1998 and $32 billion in 2002 (179). However, an- 
other report put the R&D spending of the top 20 pharmaceutical companies at $37 
billion in 2001 (180). 

Clinical development time has doubled since 1982 to 5-2/3 years, and the total time 
to get a drug on the market is 12-15 yr (181,182). It may take even longer, as judged 
from the following estimate: 2-10 yr for discovery, 4 yr for preclinical testing, 1 yr of 
phase I (involving 20-30 healthy volunteers for safety and dosage), 1.5 yr for phase II 
(100-300 patient volunteers for efficacy and side effects), 3.5 yr for phase III (1000- 
5000 patient volunteers monitoring effects of long-term use), 1 yr of FDA review and 
approval, and 1 yr of postmarketing testing (183). The total time can thus be 14 to as 
long as 22 yr. The time commitment has not changed since 1999, but the estimated cost 
of doing this rose from $500-600 million to $900 million (179). Two-thirds of the cost 
is spent on leads that fail in the clinic (36). One-half of all potential drugs fail because 
of adsorption, distribution, metabolism, excretion, or toxicity (ADME/TOX) problems. 

A few years ago, it was thought that combinatorial chemistry and high-throughput 
screening (HTS) would yield many new hits and leads, but the result has been disap- 
pointing, despite the extraordinary amount of money spent (184,185). After it was 
developed in the early 1990s, HTS methods achieved speed and miniaturization but 
discovery of new leads did not accelerate. HTS methods allowed 100,000 chemicals to 
be assayed per day, and combinatorial and other chemical libraries of 1 million com- 
pounds were available commercially. Despite this, no drugs had been approved that 
resulted from HTS by 1999 (186). Since 1998, R&D spending by the top 20 pharma- 
ceutical companies increased from $26 billion in 1998 to $37 billion in 2001, but the 
number of NDAs decreased from 34 to 16. Whereas FDA drug applications (total of 
NDAs, INDs, orphan drug applications, and so on) peaked at 131 in 1996, the number 
dropped steadily to 78 in 2002 (187). In 2001, there was a 20-yr low in the number of 
new active substances approved by the FDA (188). The number was 37, and was part 
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of a continuous drop since 1997. During 1978-1980, the average number of new chemi- 
cal entities launched by the pharmaceutical industry was 43; in 1998-2000, the num- 
ber had dropped to 33. Only 17 drugs were approved in 2002, compared to 32 or more 
per year in the late 1990s (189). 

The advent of combinatorial chemistry, HTS, genomics, and proteomics has "not 
yet delivered the promised benefits" (183). Investment in genomics and HTS has had 
no effect on the number of products in preclinical development or phase I clinical 
trials. The problems are that HTS has not been applied to natural product libraries, and 
combinatorial chemistry has not been applied to natural product scaffolds (190-192). 
Natural product collections have a much higher hit rate in high-throughput screens 
than do combinational libraries (193). Breinbauer et al. (194) pointed out that the num- 
bers of compounds in a chemical library are not the important point; it is the biological 
relevance, design, and diversity of the library, and that a scaffold from nature provides 
viable, biologically validated starting points for the design of chemical libraries. Accord- 
ing to Sam Danishefsky, prominent synthetic chemist at Memorial Sloan-Kettering Can- 
cer Center in New York, it is appropriate "to critically examine the prevailing supposition 
that synthesizing zillions of compounds at a time is necessarily going to cut the costs of 
drug discovery or fill pharma pipelines with new drugs any time soon" (195). 

Some companies have dropped the screening of their natural product libraries because 
they considered that such extracts were not amenable to HTS (186). Even worse, we hear 
that combinatorial chemistry is replacing natural product efforts for discovery of new 
drugs, and that most companies have even dropped their natural product programs to 
support combinatorial chemistry efforts. This makes no sense, since the role of combi- 
natorial chemistry, like those of structure-function drug design and recombinant DNA 
technology two and three decades ago, is that of complementing and assisting natural 
product discovery and development, not replacing them (196). Instead of downgrading 
natural product screening, there is real opportunity in combining it with HTS, combi- 
natorial chemistry, genomics, proteomics, and new discoveries being made in 
biodiversity. 

Genomics will provide a huge group of new targets against which natural products 
can be screened (197). By 2003, there were 79 sequenced genomes of bacteria (198). 
Useful targets for antibacterial therapy revealed by genomics are (1) two-component 
signal transduction systems, (2) FtsA-FtsZ interaction for cell-division inhibition, (3) 
MurA for cell-wall synthesis inhibition, (4) chorismate biosynthesis, (5) isoprenoid 
biosynthesis, and (6) fatty acid biosynthesis (199). Additional new targets for further 
discovery efforts include bacterial signal peptidases (200), non-(3-lactam inhibitors of 
(3-lactamase, lipid A biosynthesis, tRNA synthetases (201), DNA replication (helicase, 
encoded by dnaB; DNA gyrase subunit B, encoded by gyrB), enoyl-ACP reductase, 
protein secretion, intermediary metabolism (dihydrofolate reductase encoded by folA; 
UMP kinase encoded by pyrH), translation (methionyl-tRNA synthetase encoded by 
metG; elongation factor Tu encoded by tufA[BJ). E. coli strains with low levels of such 
enzymes have been constructed; they are hypersusceptible to specific inhibitors of each 
target and useful for whole-cell assays of new antibacterials (202). 

A genomic comparison of the pathogenic Haemophilus influenzae with a nonpatho- 
genic E. coli revealed 40 potential drug targets in the former (203). Similarly, a com- 
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parison of genomes of Helicobacter pylori with E. coli and H. influenzae revealed 594 
H. pylori-specific genes, of which 196 were known, 123 of the known genes being 
involved in known host-pathogen interactions and 73 targets of novel potential (204). 
Bacterial pathogens contain about 2700 genes, of which current antibiotics target less 
than 25 (205). When the S. cerevisiae genome sequence was announced in 1996, the 
functions of only one-third of its 6200 predicted genes were known (206). By 2002, 
some 4400 yeast genes were characterized (207). Of the genes of the S. cerevisiae 
genome, about a dozen are potential targets: chitin synthetase, (3-1,4-glucan synthetase, 
tubulin, elongation factor 2, ,/V-myristoyl transferase, acetyl-CoA carboxylase, inositol 
phosphoryl ceramide synthase, membrane ATPase, mannosyl transferase, tRNA syn- 
thetases, lanosterol dehydrogenase, and squalene epoxidase (208). 

Although the performance of the pharmaceutical industry has been dismal recently 
because of poor decisions, the biotechnology industry is doing very well. Between 
1997 and 2002, 40% of the drugs introduced came from biotechnology companies. 
The five largest pharmaceutical companies have in-licensed from 6 to 10 products 
from biotechnology or specialty pharmaceutical companies, yielding 28-80% of their 
revenue. The biotechnology industry had two drug/vaccine approvals in 1982, none in 
1983-1984, one in 1985, rising to 32 in 2000! The number of patents granted to bio- 
technology companies rose from 1500 in 1985 to 9000 in 1999. Some biotechnology 
companies are entering the area of natural product screening and, in the end, may save 
this valuable resource from falling into obscurity. 
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Summary 

Historically, nature has provided the source for the majority of the drugs in use today. This owes 
in large part to their structural complexity and clinical specificity. However, only a small percentage 
of known microbial secondary metabolites have been tested as natural-product drugs. Natural-prod- 
uct programs need to become more efficient, starting with the collection of environmental samples, 
selection of strains, metabolic expression, genetic exploitation, sample preparation and chemical 
dereplication. A renaissance of natural products-based drug discovery is coming because of the trend 
of combining the power of diversified but low-redundancy natural products with systems biology and 
novel assays. This review will focus on integrated approaches for diversifying microbial natural- 
product strains and extract libraries, while decreasing genetic and chemical redundancy. Increasing 
the quality and quantity of different chemical compounds tested in diverse biological systems should 
increase the chances of finding new leads for therapeutic agents. 

Key Words: Diversity; microbial natural products; drug discovery; redundancy; dereplication; 
synergy. 

1. Introduction 

The most well-known examples of natural product are antibiotics (1 ). The "Golden 
Age of Antibiotics," from the 1940s to the 1970s, was sparked by the serendipitous 
discovery of penicillin by Alexander Fleming in 1928 and its development by Chain 
and Florey in the 1940s. Another remarkable milestone in the medicinal use of micro- 
bial metabolites and their derivatives was the introduction of the immunosuppressants 
cyclosporin A, FK-506 (2), and rapamycin (2,3). Other examples are the commercial- 
ization of the antihyperlipidemic lovastatin and the recent discovery of guggulsterone 
(4). Microbial natural products have also been developed as antidiabetic drugs, hor- 
mone (ion-channel or receptor) antagonists, anticancer drugs, and agricultural and phar- 
maceutical agents (5). Microorganisms not only produce secondary metabolites that 
affect cell growth, but also accumulate bioactive compounds that interact with valu- 
able targets of cell metabolism and signaling that are not directly correlated with cell 
death (6). 

Drug discovery strategies for pharmaceutical and agrochemical applications are in a 
revolutionary period (7). The completion of the Human Genome Project and the eluci- 
dation of dozens of microbial pathogens genomes have provided thousands of disease- 
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related targets for screening. Automated instrument systems, robots, high-throughput 
screening (HTS) platforms, and high-throughput chemistry have provided powerful 
tools for screening large compound libraries in a cost-effective manner. Combinatorial 
chemical, synthetic chemical, and natural-product libraries provide abundant resources 
for target-based screening. The success and failure of drug discovery is coupled to the 
novelty and meaningfulness of the applied biological test systems as well as the amount 
and structural diversity of the test compounds available (8). 

In the early to mid-1990s, combi-chem companies attempted to fill the void with 
large numbers of new molecules. Unfortunately, it appears that the chemistry employed 
did not create sufficiently diverse or pharmacologically active molecules. It is clear 
that the future success of the pharmaceutical industry depends on the combining of 
complementary technologies such as natural-product discovery, HTS, genomics and 
proteomics, combinatorial biosynthesis, and combinatorial chemistry. The process of 
drug discovery for therapeutic and preventive medicines is facilitated by increasing 
knowledge of biological mechanisms, such as treatment efficiency, potential side effects, 
and the growing threat of drug resistance. A large amount of disease-relevant protein 
targets have been identified and validated from genomics, proteomics, and systems/ 
computational biology approaches. Novel targets and novel HTS assays and measure- 
ment systems are emerging to allow more sensitive, reliable, and low-background 
searches for new potential drugs among natural products. 

The advantages and challenges of natural product-based drug discovery as com- 
pared to its synthetic chemistry counterpart are summarized as follows: 

Advantages: 

• Natural products offer unmatched chemical diversity with structural complexity and bio- 
logical potency (9). 

• Natural products have been selected by nature for specific biological interactions. They 
have evolved to bind to proteins and have drug-like properties. 

• Natural products are a main source of pharmacophores. Drugs such as cyclosporin A and 
FK-506 are not only active as immunosuppressants but also as antiviral, antifungal, and 
antiparasitic agents. 

• Natural-product resources are largely unexplored. Novel discovery strategies will lead to 
novel bioactive compounds. Natural-product extracts are complementary to synthetic and 
combinatorial libraries. About 40% of natural-product diversity is not represented in syn- 
thetic compounds libraries. 

• Natural products can guide the design of synthetic compounds (10). 

• Research on natural products has led to the discovery of novel mechanisms of action — for 
example, those of immunosuppressants and guggulsterone (4), 

• Natural products are powerful biochemical tools; they serve as "pathfinders" for molecu- 
lar biology and chemistry, and in the investigations of cellular functions. 

Main challenges: 

• The lack of systematic exploitation of ecosystems for the discovery of novel microbial 
compounds had resulted in random sampling and has missed the true potential of many 
regions (77). 

• Little effort has focused on the isolation and cultivation of less culturable microorgan- 
isms. The discrepancy between the number of microbes detected by molecular methods 
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and the number of strains in culture, demonstrates that there remains a relatively untapped 
source of novel strains in all ecosystems (12). 

• The selection of strains has traditionally been based on morphology, rather than on the 
more powerful approaches of chemical diversity and genotype (13). 

• Lack of dereplication has resulted in a redundancy of strains and compounds within many 
natural-product extract libraries (14). 

• The characterization and isolation of active compounds from natural-product extracts are 
extremely labor intensive and time consuming (15). 

• The production of adequate quantities of the active compound needed for drug profiling 
may require extensive media optimization and scale-up (16). 

This review concentrates on the challenges of efficiently diversifying a library of 
microbial compounds, and does not deal with other problems such as preparation of 
samples, scale-up, chemical identification, and so on. Systematic approaches to maxi- 
mize the biodiversity of microorganisms within a natural-product library are discussed 
from the following three perspectives: (1) isolation and selection of samples from 
diverse ecosystems; (2) manipulating microbial physiology to activate microbial 
natural-product biosynthetic machinery; and (3) genetically modifying strains for pro- 
duction of unnatural microbial natural products. By manipulating all three of these 
approaches, the diversity of an extract collection can be maximized, and in doing so, 
the chance of finding a "hit" can be increased. The quality of a microbial natural- 
product library is built on the dynamic equilibrium between diversification and reduc- 
ing redundancy of microbial natural products. Therefore, strategies for obtaining high 
quality of a microbial natural-product library are discussed here. 

2. Sample Collection and Selection From Diversified Ecosystems 

Existing microbial natural-product drugs were originally isolated from all over the 
Earth. Microbes can sense, adapt, and respond to their environment quickly and help 
compete for defense and survival by generating unique secondary metabolites. These 
compounds are produced in response to stress. In diversifying microbial natural-prod- 
uct extract libraries, the greatest influence will undoubtedly be the genetic diversity of 
strains. By maximizing the types of samples collected and diversifying the isolation 
strategies, a highly diverse microbial collection can be generated. 

2. 1. Ecosystem Rationale 

Collecting environmental samples for isolation of interesting microorganisms has 
often been conducted without defined strategies (17). Such programs need to take into 
consideration the biogeography of ecosystems, number of samples collected, and iso- 
lation procedures. It is important to increase the number and diversity of sampling sites 
(18), and it is especially important to look at underrepresented sites. Diverse regions 
such as the deep subsurface, the deep sea, and sites that have extreme temperature, 
salinity, or pH often generate novel microorganisms and therefore provide the poten- 
tial for novel compounds (19). Temperate ecosystems should not be excluded, because 
they also have the potential to provide many novel species, especially when novel iso- 
lation strategies are used. Cyclosporins, rapamycin, penicillin, and rifamycin, among 
others, were isolated from microorganisms collected in temperate regions. 
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It is still debated whether most microorganisms are cosmopolitan or endemic to 
specific geographic areas. There is a lack of detailed information in the field of geo- 
graphical distribution of microorganisms (19). In some cases, the presence of an endemic 
species can be detected; for example, several groups of bacteria appeared to be endemic 
to an ice microbial community (20). However, the definition of a microbial species is 
difficult, especially for prokaryotes, which exchange parts of their genomes with suffi- 
cient ease to make difficult the biologically meaningful definition of a species (21). In 
order to increase the chance of constructing a library of microorganisms with high 
diversity, the first step is to consider different geographic areas, including biodiversity 
hot spots (22). There are more compounds in nature than possible molecular targets. 
One should concentrate on sampling in regions with different climates, fauna, and flora. 

It is important to analyze properly the various ecosystems of a region. For example 
in Massachusetts, there are 13 eco-regions, from the Berkshire Highlands to the coastal 
regions of Cape Cod (http://www.state.ma.us/mgis/eco-reg.html), each of which has 
various subecosystems. Microbiologists should work closely with botanists and ecolo- 
gists to obtain as many different samples and microorganisms as possible from one 
ecosystem to maximize the likelihood of finding novel strains and in turn novel chemi- 
cals. In almost all ecosystems, no matter how harsh, a group of organisms will grow 
and thrive. In these unique sites, we can expect to find unique metabolic pathways that 
have evolved to allow microorganisms to adapt and survive. 

With the discovery that microbial symbionts were driving the metabolism of 
tubeworms in deep-sea hydrothermal vents, it was realized that an oasis of rich diver- 
sity could be found even in areas that were thought to be devoid of life. The deep sea, 
one such ecosystem, is actually a rainforest with a diversity of more than 10 million 
species, more than 60% of which are unknown (23). In addition to the open ocean, 
there are diverse and dynamic areas such as mangroves, coral reefs, hydrothermal vents, 
and deep-sea sediments in which to search for microbes. Natural products have been 
isolated from marine invertebrates such as sponges, tunicates, mollusks, and bryozoans 
(24). This not only demonstrates the numerous opportunities the oceans provide for 
discovering new compounds, but also validates the pharmacological value of exploring 
the oceans for novel compounds. There are some concerns about the isolation of marine 
microbes. Some researchers claim that this resource is not thoroughly explored because 
these organisms are hard to maintain in the laboratory environment. However, one suc- 
cessful case was the recent discovery of a new genus of actinomycetes found only in 
the marine environment, i.e., Salinospora (25). One isolate produces salinosporamide 
A, a potent anticancer agent. Thus, the oceans can no longer be ignored (26). Other 
regions that warrant further study are locations with extremes of pH, temperature, and 
salinity. 

A defined sampling strategy must be adopted. To comprehensively explore a par- 
ticular site, multiple discrete locations within the site must be sampled. Many types of 
samples should be selected in one ecosystem — e.g., soils, sediments, organic material, 
dung, dead animals, plants, and lichens. Soil still remains an important source because 
it carries higher populations of microbes than any other habitat (27). DNA community 
analysis has proven that the number of types of organisms found within a microbial 
community is much higher than previously thought. One analysis of the reassociation 
kinetics of total bacterial DNA in a 30-g soil sample suggested that it contained more 
than 500,000 species (21). 
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Plants and lichens offer niches for interaction between microorganisms and eukary- 
otic cells. Many natural products initially isolated from plants and animals were actu- 
ally produced by microbial symbionts found within the tissue of the host (23). In some 
incidences where the microorganism or symbiont could not be cultured axenically, 
the genes responsible for the production of the active compounds were attributed to 
the microorganism (28). 

Endophytes are microorganisms including unicellular bacteria (29), actinomycetes 
(30), and fungi (31) that spend part or all of their life cycle colonizing, either inter- or 
intracellularly, the healthy tissue of a plant (32). Almost all vascular plants and mosses 
examined so far have endophytic bacteria or fungi within their tissues (29). The num- 
ber of strains found within the plant tissue can vary from a few to several hundred per 
plant. This relationship between the microorganism and the plant can range from mildly 
phytopathogenic to symbiotic. Endophytes produce a range of compounds (16), some 
of which help the plant to survive and thrive in its ecosystem and some that help it fight 
off infection (33). Plants therefore provide an obvious source for isolation of microor- 
ganisms that could potentially produce novel natural products. Of special interest are 
the large number of alkaloids and taxol produced by endophytic fungi (16). 

Lichens, symbiosis between fungi and cyanobacteria, are another source of microor- 
ganisms living in a unique and competitive environment. In each lichen sample, the 
fungus forms a thallus or lichenized stoma (34,35). Furthermore, in addition to the 
symbiotic fungal strains, other fungi and bacteria live as endophytes inside the lichens, 
or as epiphytes on the lichens. The fungi within the lichen often produce unique sec- 
ondary metabolites. Over 800 lichen secondary metabolites have been collected so far. 

2.2. Isolation Strategies 

Microbial diversity in the environment is far greater than reflected in most strain 
collections, due to the number of organisms that cannot be cultured using standard 
culture conditions (36). Therefore, a vast majority of microorganisms in many samples 
remain unexplored. Molecular techniques allow detection of organisms that were 
missed using culture-dependent methods. Culture-independent methods, such as DNA 
clone libraries, have allowed identification of vast numbers of new organisms that are 
different from anything previously cultured. It is estimated that as few as 0.1-1% of the 
organisms living in the biosphere have been cultured and characterized in the labora- 
tory setting. In one study, approx 10 7 bacteria were counted in 1 g of soil (37), but as 
few as 0.1% were culturable using standard culture techniques. The other 99.9 % of the 
population may represent novel genetic diversity (14), and may produce novel natural 
compounds. In 1987, when Dr. Carl Woese (116) proposed the five-kingdom phyloge- 
netic tree, the bacteria were divided into twelve groups. The initial evaluation was done 
primarily with bacteria in culture. By 2000, the number of groups had expanded to 36, 
of which 13 do not have a representative in culture (38). Approximately 6000 bacterial 
species have been described, but the number of bacteria that exist in nature is predicted 
to be as high as 600,000 (39). The situation may even be more extreme for fungi. The 
currently accepted number of described species of fungi is 72,000, but the estimated 
figure of fungi that exist in nature is 1.5 million. This suggests that there are diverse 
novel microorganisms in the natural environment that could be used as sources for 
drug discovery. The argument against putting effort into culturing less-culturable 
organisms is that it is very time consuming and the techniques used for one organism 
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may not be applicable to others. With this in mind, several biotech organizations are 
now seeking means to harness the potential of these less-culturable strains. 

Microbial community analysis has revealed that the microorganisms in culture not 
only represent a small part of the population, but may not be the most prevalent in the 
natural environment (12). It is expected that one of the largest efforts in the next decade 
will be exploring means to culture less-culturable organisms. It is thought that the rea- 
son for the enormous discrepancy between the total viable cell counts and those of 
culturable cells may be due to the following: (a) cell damage by oxidative stress, (b) 
formation of viable but nonculturable cells, (c) inhibition by high substrate concentra- 
tions, (d) induction of lysogenic phages upon starvation, and (e) lack of cell-to-cell 
communication in laboratory media (41). Two main approaches have been used to 
enhance the resuscitation of less-culturable strains. The first is the addition of cell- 
signaling molecules and the second is the use of oligotrophic isolation media. 

Microorganisms use pheromones to communicate with each other, both within and 
across species (42). Microorganisms may require signaling from other organisms in 
order to grow, even if provided with the appropriate nutrients. The addition of growth 
factors to culture medium has been used successfully to increase the resuscitation of 
greater numbers of microorganisms and thus higher microbial diversity. The addition 
of pyruvate or catalase to reduce oxidative stress during isolation can increase the num- 
bers and diversity of strains isolated (43). The addition of cyclic AMP (44) and ,/V-acyl- 
homoserine lactones have both been shown to increase the resuscitation of starved 
cultures under laboratory conditions (41). In enterobacteria, cAMP is involved in the 
regulation of the majority of the genes expressed under starvation, including those 
coding for high-affinity sugar-transport systems (45). 

A second approach for increasing the resuscitation of less-culturable strains is the 
use of oligotrophic isolation media. It has been well documented that conventional 
media have extremely high concentrations of complex organic compounds compared 
with those present in the natural environment. Most isolation media allow for growth 
of only a selected group of strains and inhibit the majority of the natural population. 
Oligotrophic media not only allow the growth of less-culturable microorganisms but 
also prevent the overgrowth of fast-growing "microbial weeds" (46). Using unamended 
site water as a growth medium, unique populations of microorganisms have been cul- 
tured (47). A variation of this method is encapsulation of single cells within gel 
microdrops that contain low-nutrient media (46) or within specialized growth cham- 
bers incubated in site water (42). 

One argument against applying less-culturable strains to a drug-discovery program 
has been that they could not be cultured at a high enough cell density. The argument for 
including them is that although they may initially require the addition of growth factors 
or oligotrophic growth conditions, there is evidence that once cultured, the organisms 
can be grown out in nutrient-rich media. Using 960 cells cultured in microdrops, 67% 
of the cultures were able to grow to densities of >10 7 cells/mL (46). This allows the 
cells to be cultured in a manner that could be easily applied to drug-discovery plat- 
forms. 

An alternate approach to access unculturable microorganisms is to clone the DNA 
directly from uncultured microorganisms (see Subheading 3.1.). 
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3. Manipulating Microbial Physiology to Activate Microbial 
Natural-Product Machinery 

In order to exploit the true potential of microorganisms, the physiological growth 
conditions used for generating extracts need to be diversified. And microbial metabo- 
lism can be influenced to produce qualitatively and quantitatively different chemical 
compounds. The physiology of secondary metabolism has often been neglected. Very 
few of the regulatory features of secondary metabolism have been elucidated (48). The 
global situation in physiological regulation is very complex, as a result of the variety of 
microbes, the variety of biosynthetic pathways, and the variety of controls. Environ- 
mental conditions remain, however, a key element in the discovery and production of 
secondary metabolites. Strategies have to be developed in order to exploit the full meta- 
bolic potential of each microorganism in order to maximize chemical diversity. Bio- 
chemical pathways, induction, and regulation of secondary metabolism by internal 
molecules (such as the autoregulators) have been reviewed previously (49,50). 

3. 1. Various Optima 

The optimal conditions for biosynthesis of secondary metabolites are usually not 
identical to the ones for growth. In general, the optimal zones are narrower for second- 
ary metabolite production. Physiological regulations vary with different microorgan- 
isms and different metabolic pathways. The qualitative and quantitative aspects for 
secondary metabolite production must be taken into consideration. 

There are usually differences between the optimal carbon sources for growth and 
those that are good for secondary metabolism (51). For example, glucose is an excel- 
lent carbon source for growth in most cases, but depresses the production of a series of 
secondary metabolites such as actinomycin, cephalosporin, ergot alkaloids, and tylosin. 
However, glucose does not interfere with the production of aflatoxin, aminoglycosides, 
or chloramphenicol (52), and the production of anticapsin by Streptomyces griseoplanus 
is maximal at a concentration of glucose as high as 100 g/L (53). 

Secondary metabolic pathways are often negatively affected by nitrogen sources 
favorable to rapid growth. For example, ammonium salts inhibit the production of 
cephamycin, fusidin, and rifamycin (54); however, some biosynthetic pathways are not 
affected, such as that for pyrroindomycins in Streptomyces rugosporus (55). Optimal 
production of gibberellic acid by Gibberella fujikuroi in a defined medium requires a 
concentration of 22.5 mM ammonium sulfate. Complex natural sources of nitrogen 
such as soybeans and casamino acids are also good. The influence of amino acids on 
secondary metabolite production is very variable and can depend on the precursor or 
the natural inducer. Inorganic phosphate suppresses the synthesis of many secondary 
metabolites. Thus, the optimal phosphate concentration needed for production of sec- 
ondary metabolites is generally lower than that required for growth. However, the 
optimal concentration can vary drastically between strains. The concentration can be 
as low as 0.08 mM for the synthesis of bacitracin by Bacillus licheniformis , or as high 
as 8 mM for the production of novobiocin by Streptomyces griseus (56). In some inci- 
dences, high phosphate concentrations can even induce the biosynthesis of some metabo- 
lites (57). Secondary metabolism often requires trace elements such as iron, zinc, and 
manganese. Once again, the optimal concentrations vary from process to process, but 
often range from less than 0.1 to 1 x 10" 3 M (58). 
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Optimal temperatures for the production of secondary metabolites are, in general, 
lower than for growth, but can vary considerably. For example, a temperature of 21°C 
is optimal for biosynthesis of cyclosporin by Tolypocladium inflatum, 25°C is optimal 
for the synthesis of streptomycin by Streptomyces griseus, and 28 °C is best for 
nebramycin formation by Streptomyces tenebrarius. Most of the known secondary 
metabolites are produced under standard aeration conditions, but some require lower 
and some higher dissolved oxygen concentrations (59). Extremely high aeration is 
required for optimal production of secondary metabolites by Streptosporangium (60). 

Incubation time is another key point and is dependent on the growth characteristics 
of the microorganism and the culture conditions. For example, actinomycetes can vary 
from 3 d for the maximum production of arylomycins by a strain of Streptomyces (61) 
to 12 d for maximum production of pramicidin S by a strain of Actinomadura (62). The 
addition of adsorbents such as XAD-16 resin to liquid cultures can also enhance the 
concentration of secondary metabolites produced (63). Most programs for the discov- 
ery of novel metabolites from microorganisms use liquid shaken cultures for cultiva- 
tion of microorganisms. This provides an easy and well-controlled system. Solid-phase 
fermentation allows the biosynthesis of other metabolites, mainly related to the sporu- 
lation process (64). Both types can be scaled up effectively (65). 

3.2. Selection of Culture Conditions 

The optimal conditions for secondary metabolite production vary from microbe to 
microbe. The composition of media and the culture conditions have a great impact on 
the production of secondary metabolites. In a discovery program, one is working with 
a large series of unique and ubiquitous microorganisms. Multiple conditions are neces- 
sary in order to allow the expression of secondary metabolites. Both static and shaken 
liquid cultures should be incorporated. Different incubation temperatures must be cho- 
sen. Addition of elicitor compounds such as heavy metals, oils (66), microbial or fun- 
gal cell-wall components, and dimethyl sulfoxide (DMSO) (67) can increase the 
biosynthesis of certain secondary metabolites. The media can include carbon and nitro- 
gen sources at different concentrations, as well as other nutrients such as phosphate 
and trace elements or elicitors at various levels, and using a Greco-Latin square format. 
The goal is to have a good ratio between the number of strains (genetic diversity) and 
the number of culture conditions (metabolic expression) for each microbe in one col- 
lection. 

Another powerful tool is the preselection of strains based upon growth in a series of 
media in microcultures or in small vials (68). A standard format that is amenable to 
automation allows more than 20 media to be easily tested with each microorganism, at 
different temperatures, and in liquid as well as on solid media. This allows the selec- 
tion of the best conditions for each strain, which can then be used to scale up to get 
larger volumes required for initial structure determination. This system is also adequate 
for a quick optimization program. 

For each group of strains, a series of conditions can be chosen incorporating both 
shaken liquid cultures and stationary solid cultures. Typically, five media types are 
used for each batch of cultures, and three to five incubation conditions are chosen to 
include various temperatures. In order to enhance the chance of success in such a ran- 
dom process, the conditions used for one group must be rotated. The results of chemi- 
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cal profiling and scores in screening should be constantly analyzed in order to improve 
the system. 

3.3. Physiological Exploitation of Talented Strains 

Some microorganisms are able to produce a variety of compounds from different 
chemical families and are termed metabolically talented (69). Most of the recently 
described microbial compounds are produced by actinomycetes, mostly Streptomyces 
strains, and by saprophytic filamentous fungi. One metabolically talented microorgan- 
ism, Streptomyces sp. strain Go.40/10, synthesizes at least 30 different secondary 
metabolites, many of which are new compounds (70). Some strains can synthesize 
more than 50 compounds, which can be detected only by classical chemical methods. 
Myxobacteria and fungi are also considered to be talented microbes (71). 

The genomes of actinomycetes (8 Mb) (72), fungi (13-42 Mb) (73), and myxobacteria 
(12 Mb) (74) are much larger than needed for all basic functions. Therefore, it is widely 
thought that part of the genome may contain genes for alternative metabolic pathways. 
For example, Streptomyces coelicolor A(3)2 is designated as a potent producer of sec- 
ondary metabolites. It produces methylenomycin, prodigiosin, actinorhodin, and a cal- 
cium-dependent antibiotic. In addition, several formerly unknown gene clusters 
(polyketide syntheses type I and II, nonribosomal peptide synthases) have been found 
in its genome (75). In Streptomyces avermitilis ATCC 31267, the producer of 
avermectin, 24 additional gene clusters have been sequenced (76). Genomic data have 
suggested that the myxobacterium Stigmatella aurantiaca DW4/3-1 has a much 
broader capacity to produce a much broader group of natural products than those iso- 
lated to date from this organism (77). Genes responsible for production of many com- 
pounds can be found in the genomes of nonproducing strains. The questions arise: Are 
these genes nonfunctional? Are the detection methods not powerful enough? Are these 
genes not expressed under standard growth conditions? Do the genes require external 
signaling to turn them on? More efficient detection methods, biochemical assays such 
as capillary electrophoresis (78), or chemical methods will allow the discovery of large 
numbers of novel compounds (79). As briefly described in Subheading 3.1. small 
changes in the culture conditions can have a major influence in the spectrum of sec- 
ondary metabolites synthesized. For example, the fungal strain Sphaeropsidales strain 
F-24'707 is a producer of the antifungal compound cladospirone bisepoxide. When 
this strain was grown in a combination of different media and cultivation types, eight 
new spiro naphthalenes were isolated. There were previously only six known members 
of this class of compounds. The addition of inhibitors, such as tricyclazole, inhibited 
some pathways and therefore allowed the production of other compounds, such as two 
new spirobisnaphthalenes and a rare macrolide, mutolide (79). 

Many microorganisms do not readily express natural-product gene clusters when 
grown in the laboratory (75). We have to find the right physiological signal to stimu- 
late the molecular machinery. A systematic fermentation program should be conducted 
with "talented" strains and with representative strains of poorly known genera in order 
to maximize the number of compounds produced by each strain. Such an approach 
should include: 

• Cultures grown in shaken liquid vessels and on solid media. 

• Cultures grown with media of different composition 
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• Incubation at two or more temperatures. 

• Incubation at two or more shaker speeds. 

• Incubation for at least two different time periods. 

• Media with at least two pH levels. 

• Absorbents, enzyme inhibitors, elicitors, precursors, precursor analogs, and high concen- 
tration of salts should be added to the most productive fermentation media. 

After selection of one or two potent media, the influence of the other factors can be 
analyzed using an experimental design, such as fractionated factorial or Plackett-Bur- 
man design (80). 

3.4. Co-Cultivation 

Microbial communities also hold potential for the production of novel compounds. 
In nature, microorganisms do not exist alone; they are part of tiny ecosystems. There is 
expected to be diverse signaling and cross-feeding going on between organisms that 
will elicit production of novel compounds. Although the longstanding argument against 
this type of research has been that getting a stable mixed culture is almost impossible, 
it may provide a means for exploiting the true potential of the consortia as a whole. An 
example of using co-cultivation of two microorganisms producing related products has 
been suggested as a suitable way towards diversification of microbial structures (81). 

4. Genetically Modifying Strains to Produce "Unnatural Microbial 
Natural Products" 

4. 1. Expressing the Heterologous Metagenome in a Surrogate Host 

Culturable organisms provide only a finite pool of secondary metabolites (82). One 
approach to maximize the diversity of natural-product extract libraries has been to access 
the DNA directly from uncultured microorganisms. DNA can be isolated directly from 
an environmental sample, digested into large fragments with restriction enzymes, and 
cloned into an artificial vector (14). The vector is then transformed into a surrogate 
host (83). Environmental DNA libraries can be prepared with large fragments of DNA 
from a wide range of uncultivated bacteria within an environmental sample ( 36). This 
is described as screening the metagenome, the genomes of the total microbiota in an 
environmental sample (84). The recombinant approach thus obviates the need for cul- 
turing diverse microorganisms and provides a relatively unbiased sampling of the vast 
untapped genetic diversity present in various microenvironments. As an additional 
advantage, the genes encoding a product of interest are already isolated and can be 
analyzed using the tools of bioinformatics, thus providing a potential boost to the efforts 
of analytical chemists to identify the product. Furthermore, the possibility of regulating 
the expression of isolated environmental gene clusters or combining them with genes 
for other pathways to obtain new compounds could furnish a further advantage over 
traditional natural-product discovery methodologies (113). However, it must be noted 
that these biosynthetic and regulatory genes could be dormant in the host, and optimal 
induction conditions may be required for the production of novel natural products. 

Advances in DNA-sequencing and bioinformatics technologies now make it pos- 
sible to rapidly identify the clusters of genes that encode bioactive compounds and to 
make computer predictions of chemical structure based on gene sequence information 
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(85). A high-throughput genome-scanning method has recently been developed that 
allows discovery of metabolic loci, independent of expression. Genome sequence tags 
(GSTs) are genes involved in natural-product biosynthesis. These GSTs are used as 
probes to screen for the presence of these genes within a clonal library. Any clone that 
contains a GST can then be screened for novel natural-product gene clusters. More 
than 450 natural-product clusters have been identified in this manner (85). 

4.2. Gene Mixing (Combinatorial Biosynthesis) 

The traditional way to diversify unnatural microbial natural products is by random 
mutagenesis or by culturing microbes with nonnatural precursors. However, the dis- 
covery in prokaryotes that the genes for natural products are usually clustered, made it 
possible to clone an entire pathway into a vector (14). Many natural-product genes are 
modular and produce multifunctional enzymes. They have a high degree of plasticity. 
By interchanging and moving around genes within these clusters, hybrid enzymes can 
be produced that are capable of synthesizing an unlimited set of new molecules (82). 

The modular nature of many secondary metabolite genes provides an ideal system to 
genetically engineer formation of unnatural microbial natural products by incorporat- 
ing genes from different pathways. An example of such an approach involves the 
polyketide synthases (PKSs), which are large, multi-domain enzymes that produce 
polyketides including antibacterials, immunosuppressants, and cholesterol-lowering 
agents (86). PKSs are encoded by a cluster of continuous genes and have a linear modu- 
lar organization of similar catalytic domains that build and modify a polyketide back- 
bone (87). Microbial genes can be engineered to produce enzymes with novel catabolic 
activities (82). The cloning of biosynthetic pathway genes from Streptomyces allowed 
the production of novel compounds by mixing the antibiotic systems of different antibi- 
otic-producing strains (72). Novel compounds were produced by gene transfer between 
strains producing the isochromanquinone antibiotics actinorhodin, granaticin, and 
medermycin (88). This pioneering work has been developed by many others for the 
production of novel enzymes and unnatural natural products (83,89,90). These modu- 
lar PKS clusters have been manipulated through introduction of different loading 
domains that specify a branched chain or cyclic substrate and direct inactivation or 
insertion of individual catalytic domains to produce new enzymes (87). Combinatorial 
biosynthesis has also been used effectively to generate novel compounds and enzymes 
in type I and type II PKS systems. This type of approach can be applied to many other 
modular enzyme systems (91,92). 

Genetic engineering and pathway modification will undoubtedly be important in 
strain optimization in the future. These methods provide a targeted approach for con- 
struction of novel pathways and in turn the potential of novel natural products (93). 

5. Monitoring Diversity by Dereplicating Microbial Strains and Chemical 
Extracts 

The cost of screening microbial natural products is high, and false positives waste 
the limited resources available for isolation and structural characterization. In order to 
increase the output, it is important to spend more time upfront on the prescreening of 
strains and extracts. In order to focus on the extracts of most interest as quickly as 
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possible, and to avoid repeatedly isolating the same common natural products, efficient 
dereplication process are of the utmost importance. One may argue that high-through- 
put screening, in which thousands of extracts can be screened in little time, makes the 
use of duplicate cultures redundant. However, we must consider that screening a more 
diverse set of cultures will increase the diversity of the extract library and in turn increase 
the chance of finding a "hit." Dereplication of the extracts can be done either at the level 
of the microorganism or the chemical extracts. Database-linking of microbial genetic 
taxonomy with extract diversity is extremely important (Fig. 1). Cluster analysis of an 
unknown sample and comparison of its taxonomy and extract chromatography to inter- 
nal databases as well as published literature will provide valuable information to deter- 
mine whether an extract and/or activity is novel or not. 

5. 1. Characterization and Selection of Microbial Strains 

Ecopia BioSciences Inc. developed an automated genomics platform that predicts 
the chemical structures of natural products by reading the sequences of the gene clus- 
ters that direct their synthesis. By surveying the genome, all of the natural products that 
a microorganism can make are identified before fermentation studies begin, and the 
downstream production and purification strategies are specifically tailored to isolate 
likely new chemical entities (NCEs) and avoid the re-isolation of known compounds. 
The integration of new genomics technologies greatly increases the efficiency of dis- 
covery and makes it possible to build a robust pipeline of NCEs from a small collection 
of microorganisms, providing a new paradigm for natural-product discovery (85). They 
scan the genomes of selected microorganisms that were reported to produce known, 
structurally diverse natural products, to build a database of gene clusters covering the 
full range of natural-product chemical diversity without sequencing entire genomes. 
This enables a strategy to prescreen in mini-scale for one microbe of interest in many 
more media and growth conditions, and look only for a specific compound property, 
such as molecular weight, ultraviolet (UV) absorbance, and lipophilicity predicted by 
the specific gene clusters. Then scale-up technologies could be used to identify and 
purify the specific compounds. The knowledge of the biosynthetic pathway of the com- 
pounds will guide rational design or mutagenesis to improve their yields. 

Any library of microorganisms is likely to have a high number of duplicate strains. 
Although identical strains from the same site may be excluded on isolation, many 
strains of the same species may be collected over time from a wide range of collection 
sites. Although these strains may be useful later in strain optimization, they are redun- 
dant and costly in the initial screening phase and decrease the probability of finding 
novel compounds. Thus, it is important to dereplicate the culture collection. However, 
one must consider that strains of the same subspecies may produce different com- 
pounds. 

Many methods can be used to dereplicate cultures, but molecular techniques are 
especially well suited to this type of analysis. However, bacterial taxonomists have not 
yet reached a consensus for defining the fundamental criteria of biological diversity to 
the species (94). Prokaryotes exchange chunks of their genomes too frequently to make 
any meaningful species definition (21). An accurate definition of a fungal species is 
also problematic. Certainly, for all types of microbes, the basic unit is the "ecotype" 
(94), also called the "geovar" (20). This is the reason why dereplication of strains has 
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Fig. 1. Database linking microbial genetic taxonomy with extract diversity. The microbial 
genetic tree is generated based on ribosomal RNA sequence analysis. Chemical extract chro- 
matography is linked to the taxonomy data by a bio-informatics approach. 



to be considered only within populations isolated from a particular geographical/eco- 
logical region. A strain of Streptomyces hygroscopicus isolated from California is not 
necessarily identical to one isolated from China. Indeed, more than 200 secondary 
metabolites have been isolated from various Streptomyces hygroscopicus strains. 

When building a library of microorganisms for use in HTS, thousands of strains are 
isolated and have to be chosen and characterized. The first step is careful morphologi- 
cal observation. This allows the cultures to be separated into taxonomic groups. Fur- 
ther speciation can be done using molecular or biochemical methods. Biochemical 
culture-dependent techniques such as fatty acid analysis, pyrolysis mass spectrometry 
(95), and FT-IR analysis (96) were developed initially for clinical isolates and cannot 
be easily applied to environmental samples that require prolonged growth periods. 
Changes in the media, incubation temperature, and growth period can alter the profile 
of the organism, and hence results can be compared efficiently to one another only 
within one experiment. However, if the culture conditions can be standardized, the use 
of pyrolysis mass spectrometry analysis can reflect similar clustering of taxonomic 
groups as molecular methods (97). 

The morphogenic groups can be separated further using molecular methods such as 
restriction fragment length polymorphism (RFLP), which could differentiate strains to 
the subspecies level (98-100). Ribosomal genes, including the intragenic spacer regions, 
have been used routinely to differentiate both fungal and actinomycete strains. Thou- 
sands of sequences are available in GenBank and the Ribosomal Database Project that 
can be used to phylogenetically identify interesting organisms. The pitfalls of relying 
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on polymerase chain reaction (PCR)-based rRNA analysis as a measure of microbial 
diversity in environmental samples have been emphasized (101). Sequencing of other 
molecular markers, although costly, does however allow identification to the species 
level. When used in combination with RFLP, strains can be separated to the subspecies 
level (100). 

5.2. Characterization and Choice of Microbial Extracts 

Strains of the same species may generate different chemistry in the same media. The 
first physical characteristics of unknown natural products are determined during the 
chemical extraction and concentration steps. Solvent-based and acid-base partitioning 
experiments can help define hydrophobicity and types of functional groups of the natu- 
ral-product structures. 

The most effective selection method from the metabolic aspect is the chemical pro- 
file analysis by high-performance liquid chromatography (HPLC) and thin-layer chro- 
matography (TLC) data (102). Identical HPLC retention time or TLC R f values may 
not tell you whether two compounds are exactly the same, but different values defi- 
nitely indicate they are different. Micro-scale extraction procedures have been devel- 
oped (103) and can be automated. The first selection criterion is the metabolic creativity 
of strains, as the number of peaks revealed by HPLC with detectors of evaporative light 
scattering detection (ELSD), vs photodiode array (PDA), chemiluminescent nitrogen 
(CLND), and time-of-flight mass spectrometry (TOFMS) (104). Nielsen et al. pro- 
posed a method for identification and confirmation of chemical compound classifica- 
tion based on single- or multiple-wavelength chromatographic profiles (102). 
Chromatographic matrices from analysis of previously identified samples are used for 
generating reference chromatograms, and new samples are compared with all chro- 
matograms by calculating resemblance indices (111). In addition, the method allows 
identification of characteristic sample components by local similarity calculations: 

http://www.esainc.com/products/HPLC/Optical_Detectors/esa_ChromaChem.html 

Detection from ELSD is based on the universal ability of particles to cause photon 
scattering when they traverse the path of a polychromatic beam of light. The liquid 
effluent from HPLC is first nebulized, and the resultant aerosol mist containing the 
analyte particles is directed through a light beam. A signal is generated that is propor- 
tional to the mass present, and is independent of the presence or absence of chro- 
mophores, fluorophores, or electroactive groups. Since essentially every compound 
can be separated by HPLC or micro-HPLC and detected by ELSD, the ESA ChromaChem 
is equipment that every chromatography laboratory should have. It is a mass-sensitive 
device, which provides a response directly proportional to an analyte' s mass in the 
sample. The presence of functional groups or chromophores is not necessary for detec- 
tion. Relative amounts of compounds can be easily assessed by evaporative light-scat- 
tering technology. Any nonvolatile analyte can be detected, and gradient elution can be 
employed to optimize the separation. Aqueous as well as solvent-based mobile phases 
can be used to detect compounds that are not generally "seen" by other detection tech- 
niques. This detector can be used in conjunction with mass spectrometers to provide a 
complete analysis of the sample. The ChromaChem's unique nebulization system and 
temperature-controlled drift tube provides sensitivity, reliability, and reproducibility. 
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The unit's small footprint requires a minimum of bench space, allowing use under 
space-limited conditions. 

The basic methods to compare microbial extracts are HPLC-DAD (diode-array 
detection) and HPLC-MS. Researchers at Eli Lilly developed a rapid (about one 
sample per min) surrogate measure of microbial secondary metabolite production com- 
puted from the electrospray mass spectra of samples injected directly into a spectrom- 
eter (105,106). 

The development of a multi-channel mass-spectrometry interface has allowed analy- 
sis at high-throughput level. In most cases, LC-MS (liquid chromatography-mass spec- 
trometry) is the most sensitive method for obtaining dereplication information about a 
compound. A recent development is an eight-way fully automated parallel LC-MS- 
ELSD system for the analysis of natural products (107). LC-NMR (liquid chromatog- 
raphy nuclear magnetic resonance) should become operational in the near future, 
allowing the on-line identification of organic molecules (108,109). LC-NMR, although 
it has lower sensitivity than LC-MS, provides a powerful tool for rapid identification of 
known compounds and identification of structural classes of novel compounds. LC- 
NMR is especially useful in instances where the data from LC-MS are incomplete or 
do not allow confident identification of the active component of a sample. 

For strains and chemical tracking, an in-house database has to be built to integrate 
with commercially available ones such as Antibase database (Wiley Publishers, 2003) 
or the Dictionary of Natural Compounds (Chapman & Hall, London). There is no single 
technique that gives 100% confidence to differentiate any two natural-product chemi- 
cal profiles, but computer-enhanced structural determination methods could integrate 
various spectral data and raise confidence. 

5.3. Chemical Dereplication to Prevent Repeated Discovery 

Key elements in the success of a natural-product discovery program are quick iden- 
tification of bioactive compounds, early elimination of known or unwanted metabo- 
lites, and rapid determination of the structure of novel compounds. Dereplication 
strategies use analytical techniques and database searching to determine the identity of 
an active compound at an early stage. Dramatic improvements have been achieved 
during the past years mainly due to the impact of combinatorial chemistry. Natural- 
product chemistry has to take advantages of these recent developments. The final sepa- 
ration-purification procedures are not discussed here since the procedures are complex 
and depend on the characteristics of the targeted compounds; this is beyond the scope 
of this chapter. 

In the search for new microbial natural products, we have to consider the frequency 
of re-isolation of already-described metabolites from microbial cultures. Rough esti- 
mates suggest that we have isolated only a minute fraction of the compounds that exist 
in nature. Full identification of a natural product should be done only after partial puri- 
fication to determine whether this type of compound is already known or has potential 
as a useful drug. 

Natural-product samples need to be normalized by concentration or weight. Com- 
mon "interfering" groups of compounds such as detergent-like or toxic compounds 
should be removed. Samples should be grouped into related chemical classes and then 
prioritized for further fractionation. The hit profile coupled with genetic and morpho- 
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logical characterization of the strains will build an increasing level of confidence in a 
putative structure. 

The first physical characteristics of unknown natural products are determined dur- 
ing the chemical extraction and concentration steps. Partitioning experiments with sol- 
vents exhibiting a range of polarities and pH values will shed light on hydrophobicity 
and charged functional groups of the natural products. 

If the natural-product extract contains a reported commercially available compound, 
the sample and the reference standard should be co-injected for TLC and HPLC. Iden- 
tical retention times (HPLC) or Rf values (TLC) may not tell you that they are exactly 
the same, but different values definitely indicate they are different. If a standard is not 
available, the hypothesis could be tested by employing a physical test such as MS, 
looking for ions of the same approximate molecular weight. The chromatographic be- 
havior could also reveal the nature of the compounds of interest, such as logP, surface 
area, and dipole moment. 

Intelligent screening approaches towards microbial natural products are also required. 
One of the major limiting factors in the drug-discovery industry is that pharmaceuticals 
have been traditionally designed to target individual factors in a disease system, but dis- 
eases are complex in nature and vulnerable at multiple attack points. Therefore, a system- 
atic novel synergistic drug-screening approach based on a multifactorial principle is 
urgently needed. Many drugs could be more effective at a reduced dosage if low dosages 
of other synergistic compounds are introduced simultaneously. Many marketed tradi- 
tional medicines have demonstrated great efficacy and safety profiles in their long his- 
tory. However, when efforts were made to purify a single molecule, the activity often was 
lost. SynerZ Pharmaceuticals Inc. has developed a drug-discovery approach consonant 
with the systems biology framework, and complementary to the target-based approach. 
Synergistic co-drugs from natural products will enable existing drugs to be more effec- 
tive and contribute to our better understanding of multiple pathways to cure disease. 

Ketoconazole is commonly used to treat Candida infections. However, at clinical 
doses, ketoconazole is associated with important toxic side effects, including hepatitis. 
In addition, resistant strains often emerge during long-term or prophylactic treatment 
as a result of the necessarily high concentrations of drug required. The concentration of 
ketoconazole alone at 0.01 ug/mL gave only about 20% inhibition of growth (Fig. 2). 
When ketoconazole was tested at 1 ug/mL, it gave 90% inhibition of growth. However, 
the combination of ketoconazole at 0.01X with F0101604 achieved about 95% inhibi- 
tion (better than 100-fold the ketoconazole amount used here) and the mode of action 
was cidal, showing a synergistic effect of the two components rather than an additive 
effect. The natural product SNZ101 purified from F0101604 not only improved the 
efficacy of a much reduced dosage of ketoconazole, but also broadened its spectrum on 
drug-resistant strains and reduced its side effects (data not shown). It is clear that natu- 
ral product F0101604 would be disregarded in conventional screening technology for 
antifungal lead discovery, because by itself it failed to show any growth inhibition on 
fungal pathogen Candida parapsilosis ATCC 22019. 

6. Closing Remarks 

One prerequisite to natural-product discovery that remains paramount is the range 
and novelty of molecular diversity. Currently, natural products are going through a 
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Fig. 2. Synergistic effect of F0101604 with a low dosage of ketoconazole (X = 1 |jg/mL). 
Equal amounts (10 4 /mL) of Candida parapsilosis ATCC 22019 are cultured in Mueller- Hinton 
(MH) broth with Alamar Blue dye in the presence and absence of a subclinical concentration of 
ketoconazole. Samples are treated as labeled in duplicate. Fluorescence reading after overnight 
incubation at 35°C in a moist chamber is measured at Ex 544 nm and Em 590 nm, and con- 
verted to percentage of growth inhibition. 



phase of reduced interest in the drug-discovery field (112). However, new develop- 
ments may reverse this negative perception. 

The systematic exploitation of selected ecosystems, combined with the develop- 
ment of new techniques and media for isolation of novel microorganisms, will allow the 
collection of representative strains from a large part of the micropopulation. This maxi- 
mized biodiversity will deliver chemical diversity for an ecosystem. 

The direct expression of environmental DNA in heterologous surrogate hosts is 
progressing. There is a need for rapid and sensitive detection and characterization of new 
metabolites as well as their gene clusters. 

Physiological manipulation should be based on experimental design and mea- 
surement of secondary metabolism. Co-cultures will give novel insight into secondary 
metabolism and require the development of new vessels for stable mixed-culture fer- 
mentation. 

Gene mixing coupled with the genetic engineering power of PKS , for example, will 
allow the generation of hybrid or "unnatural microbial natural products." 

Total synthesis of natural products with interesting biological activities is paving 
the way for preparation of new and improved analogs. Combinatorial chemistry permits 
the selection of the best drug from a large number of candidates. Beyond synthesis and 
evaluation of organic molecules, a number of new bioorganic methods are emerging on 
the horizon. 

In natural-product chemistry, the rapid and accurate differentiation of chemical 
compound profiles is based on on-line measurement by LC-ELSD, DAD, MS, and 
NMR. Automated comparisons of the metabolite profiles of microorganisms can be used 
as a valuable method for building libraries of natural products for drug discovery ( 1 14). 
Today, more than 30,000 diseases are clinically described, but less than one-third of 
these can be treated symptomatically, and only a few can be cured (110). New chemi- 
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cal entities as therapeutic agents are urgently desired. Natural products can play a main 
role in drug discovery. New strategies for natural products-based drug discovery will 
increase chemical diversity and reduce redundancy (115). Maximizing the discovery 
of new compounds and minimizing the re-evaluation of already known natural prod- 
ucts will be crucial. 
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Summary 

Despite the fact that natural products have historically been a prolific source of new com- 
pounds, pharmaceutical drug discovery programs have moved away from natural products in favor 
of synthetic approaches. However, the abundance of synthetic compounds with similar functional 
groups and, therefore, limited chemical diversity has renewed interest in nature as a good resource for 
finding new ideas to be applied to the design of the next generation of drugs. 

One of the main issues for drug discovery programs based on microbial natural products is how 
to obtain the maximum potential from microbial strains in terms of chemical diversity of the metabo- 
lites produced. Several approaches are now available for enhancing the production and diversity of 
secondary metabolites from wild-type microorganisms. Automated comparisons of the metabolite 
profiles of microorganisms can be used as a valuable method for building libraries of natural products 
for drug discovery. Specific computer analyses of high-performance liquid chromatography chro- 
matograms from organic extracts of fermented microorganisms can be used as a tool for increasing 
chemical diversity of collections, media improvement, evaluation of natural products libraries, and 
even determination of taxonomic correlations. Examples of what can be done using some of the new 
generation of software tools to compare profiles of secondary metabolites include the evaluation of 
extraction solvents and fermentation formats for the design of natural-product collections, and even 
the determination of relationships among strains from different origins. 

Key Words: HPLC; automation; metabolite profiles; chemical diversity; extraction solvent; 
fermentation format; fermentation media; actinomycetes; fungi; taxonomy. 

1. Learning from Experiences 

In recent years, pharmaceutical companies have focused their efforts on finding new 
leads from combinatorial synthetic libraries. The large number of compounds that can 
be generated and tested is enticing, and there is a perceived incompatibility of natural 
products with some of the refined detection techniques commonly used in modern high- 
throughput screening (1), However, current evidence shows that combining small ele- 
ments does not necessarily lead to new chemical entities or real diversity (2). Since 
Nature has been creating new chemical structures in a very efficient manner over mil- 
lions of years, the large variety of bioactive compounds obtained from microorganisms 
actually makes them the most specialized synthetic laboratories in practice (3). 
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We should therefore learn from our experiences and give natural products another 
chance. Why not approach natural-products discovery from a renewed and modern 
perspective? For example, a variety of new and modern techniques have been devel- 
oped, including recent initiatives that use natural products as the building blocks for 
creating combinatorial libraries, and even using purified natural products for creating 
such libraries (4,5). With the goal of benefiting from such new approaches, we ana- 
lyzed and improved the capability of microorganisms to produce secondary metabo- 
lites, incorporating lessons learned from past experiences in drug discovery. The efforts 
of our laboratory resulted in the design of a high-quality natural -products library (HQ- 
NPL), which we believe offers some clear advantages over traditional natural-products 
libraries. 

2. Development of the HPLC Studio Tool 

The preparation of natural-products samples for drug discovery usually begins with 
the fermentation of microorganisms. A historically accepted approach to increasing the 
chances of finding new drugs has been to maximize chemical diversity by broadening 
the biodiversity of the microbes, both geographically and taxonomically (6). In addi- 
tion, it is well established that the production of secondary metabolites by microorgan- 
isms is also highly dependent on the components of the fermentation media (7,8). 
Theoretically, obtaining the maximum yield of secondary metabolites from a strain 
would entail growing each microorganism in the maximum possible number of media. 
The combination of source diversity multiplied by diversity of the fermentation condi- 
tions should produce the largest number of secondary metabolites per strain. However, 
increasing the number of sources, microorganisms, and the number of fermentations 
per microorganism and/or the number of extracts per fermentation are all quite labor 
intensive, and may be redundant. So the advantages gained thereby need to be balanced 
against practical constraints. 

Chemometrics, i.e., automated measurement of the chemical diversity of extracts 
from fermentation broths, is a key tool with which to balance research and investment. 
Automated measurement has been accomplished in our laboratory by creating a soft- 
ware tool that, among other features, selects from a large array of small-scale fermen- 
tation conditions those few conditions that provide maximum chemical diversity and 
quantity of material (9,10). The media selected from these analyses can be used to 
grow each strain in large scale to generate material for creating a HQ-NPL. 

In practice, the automated selection process includes four parts: fermentation of each 
microorganism in a battery of small-scale production media, extraction of those fer- 
mentations, chemical analyses, and final automated treatment of the data to select the 
most appropriate components (see Fig. 1). The detailed process will be described in the 
following subheadings. After this chemometric analysis and selection, each microor- 
ganism is grown on a large scale to meet screening requirements. We can thus take 
advantage of the ability of microorganisms to produce different secondary metabolites 
depending on the nature of the fermentation medium. The selection of those media in 
which broader chemical diversity is obtained is achieved through computer analyses of 
the high-performance liquid chromatography (HPLC) chromatograms of extracts, ulti- 
mately leading to an increase in the quality of a natural-products library for drug dis- 
covery screening. 
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Fig. 1. General overview of the process. 
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2. 1. Prescreening a Small-Scale Microbial Fermentation 

Fungi and actinomycetes were isolated from soil and plant samples collected in Costa 
Rica, Mexico, Panama, South Africa, and Spain. Seed cultures were incubated at 220 
rpm, 28°C and 70% relative humidity for 72 h (11). A portion (0.5 mL) of each inocu- 
lum was inoculated into ten glass tubes (25 x 150 mm) containing 10 mL of different 
production media. The over 3000 fermentation tubes were incubated in racks at an 
inclination of 35° in the same conditions for 21 d for fungal strains and 7 d for actino- 
mycetes, before harvesting. Initially, we decided to obtain a reasonable number of 
small-volume fermentation media per microorganism type, either fungal or actino- 
mycete. The composition of the production media derived from prior experience with 
different nutritional sources (10,12). The range of media compositions included as 
many different carbon and nitrogen sources as possible to amplify the scope of second- 
ary metabolites produced. Glucose, lactose, fructose, maltose, dextrin, glycerol, soluble 
starch, and the complex sources corn meal, wheat meal, millet meal, and cane molasses 
were the carbon sources. Yeast extract, primary yeast, peptone, ardamine pH, NZ Soy 
BL, pharmamedia, distiller-soluble peptonized milk, and meat extract were the nitro- 
gen sources selected (see ref. 10 for media details). Extraction of the fermentations was 
performed with 7 mL organic solvents (see Subheading 3.1. for details), stirring for 1 
h at room temperature. Only the upper half of the organic layers was processed to avoid 
interface interference during automation procedures. After drying under nitrogen, the 
residues were dissolved in 1 mL HPLC-grade MeOH and filtered through 0.2-um mem- 
branes to obtain a final 7X concentration (see Fig. 2). 

2.2. Select the Best Sets of Fermentation Conditions 

An automated procedure was developed to analyze a large number of samples and 
select the best sets of fermentation conditions. The HPLC vials were bar coded (Fig. 2) 
by querying an internal database to reduce manipulation errors. Each label referred to a 
database number, which identified the microorganism, the fermentation medium, and 
the extraction procedure. This allowed the possibility of comparing different prepara- 
tion methods such as extraction with methanol, acetone, or methylethylketone (MEK). 
Each sequence of vials included extracts of unfermented media with which to establish 
a control baseline. These samples, as well as blank methanol injections, were also used 
for quality control and to normalize between different batches of microorganisms. 

Sets included ten unfermented media, eight strains fermented in each of those fer- 
mentation media (80 vials), and methanol blank samples (chemical blanks) for each 
microorganism as a control for the process. Each one was identified by an HPLC batch 
file, so that the software could identify how many samples were injected, the database 
ID of each sample, blank control samples, corresponding blank media samples, and 
where in the hard drive of the computer the analysis data files generated by the HPLC 
ChemStation software were stored (13). Moreover, the software can link the fermenta- 
tion extracts with their corresponding blank media by querying databases, and include 
the possibility of doing it manually for special nonlabeled samples if necessary. The 
software also allows backups of the chromatograms in a portable computer, storing the 
format for later processing if needed (9). 
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Fig. 2. Small-volume sample management procedures. 



2.3. Chemical Analyses 

Analyses relied on HPLC reverse-phase gradient chromatography, as it provides 
complex information about chemical diversity for comparing organic extracts. The 
detector system was a diode-array HPLC detector, recording simultaneously at 210 nm 
and 280 nm (14). Diode-array HPLC gradient characterizations of the reconstituted 
extracts were performed on a ZORBAX Rx-C8 4.6 mm x 250 mm column. A 10 to 
100% gradient of acetonitrile in water with a flow-rate gradient of 0.9 to 1.2 mL/min 
was programmed on an 1 100 HP Agilent ChemStation, using a constant temperature of 
20°C during each 20-min analysis. Trifluoracetic acid (TFA, 0.01%) was added for pH 
control (see ref. 10 for more details). 

The chemical data obtained for each organic extract, filed in an individual folder by 
the ChemStation software, consisted of a graph at both 210 and 280 nm vs time (in 
windows metafile format), and two tables of detected peaks at each of these wave- 
lengths in MS-Excel file format. The Agilent ChemStation software determined the 
presence of peaks according to standard parameters such as the line slope, described in 
detail in the reference manual of the Agilent 1 100 series of HPLC spectrometers (13). 

In the absence of a commercially available program with which to compare more 
than two complex HPLC chromatograms, we initially relied on simple visual examina- 
tion to devise the setup of the new software tool. 
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2.4. HPLC Studio Software Generalities 

An application created especially for the project was used for data processing. The 
software allowed the combination of individual chromatograms and their comparison 
with great versatility (9). For each strain, the application selected the media showing 
both the highest chemical diversity (as measured by the number of peaks) and the high- 
est uniqueness (as measured by the presence of peaks not found in other chromato- 
grams) (see the automated prioritization of the 10 media where the encoded strain 
F098,432 was fermented, in Fig. 3). This approach provided an automated way to com- 
bine data from all the fermentation conditions obtained for a single strain and thereby 
estimate its maximum chemical diversity (see Fig. 3, column 12 for cumulative data on 
chemical diversity). 

The chromatograms obtained for a particular microorganism could be processed 
by the user by selecting the ID numbers from some or all of the fermentation condi- 
tions in the injection sequence. In that way, different conditions of the same fermen- 
tation medium, and any other data generated by the ChemStation software, could be 
analyzed. 

Assuming that each peak detected at 210 nm represented at least one metabolite, 
other data processes were needed before the comparisons of different fermentation 
media could be carried out. Initially, chromatograms from the fermented microorgan- 
ism were compared with the ones from the nonfermented media in order to remove 
background peaks caused by media components. Some degree of variability occurred 
in the HPLC analysis of complex extracts due to several factors, such as the composi- 
tion of the mixture, the quantity injected, the presence of compounds with very close 
retention times, and the detector resolution. A resolution time value, determined by 
visual comparison of different co-injections of mixtures of highly complex microbial 
extracts, was evaluated for those as + 0.03 min (1.5 times the resolution of an Agilent 
HPLC [0.02 min] and 0.065 min in absolute value) (10). 

A peak detection threshold was chosen to avoid small peaks and baseline fluctuations. 
This simplification reduced the data of the chromatogram template based on an absolute 
area value or a percentage of the total area of the chromatogram. A relative 2% cut-off 
value of the total area was used for the strains studied. Early-eluting solvent fronts and 
late-eluting re-equilibration peaks were likewise eliminated from consideration. 

The data processing creates a chromatogram for each strain, a so-called chromato- 
gram template, obtained by sequentially combining the chromatograms for each strain. 
For example, two chromatograms are combined, removing any two peaks with a differ- 
ence in retention times smaller than the given resolution time (assumed identical). Fol- 
lowing that, the software sequentially adds in the other chromatograms. We found that 
the order in which the chromatograms were added into the template did not affect the 
final result. All individual data were kept, and average retention times were calculated 
each time with previously accumulated data. It should be noted that all the data needed 
for characterizing the diversity in each fermentation medium were present in the chro- 
matogram template: peaks present in only one medium appeared in the template as 
they appeared in the original chromatogram (with retention times, areas, and extract of 
origin), whereas peaks present in several media appeared in the template with their 
mean retention times, all individual areas, and extracts of origin. Thus all peaks present 
in the chromatogram of a given medium could be obtained from the chromatogram 
template. 
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Fig. 3. High-performance liquid chromatography (HPLC) Studio report obtained for the encoded strain F-098432. 
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3. HPLC Studio Applications 

The processed data yielded information concerning the relative diversity and num- 
ber of metabolites produced in each of the fermentations. These chemometric measure- 
ments led us to choose which samples should be included in an HQ-NPL. 

3.1. Selecting the Best Extraction Solvent 

One of the first decisions in the design of a method for creating a library of natural 
products is the selection of the extraction solvent. The most commonly used solvents in 
our laboratory are methanol, acetone, and methyl-ethyl-ketone (MEK) (15). We decided 
to compare the differences among these three solvents in a study using 69 fungal and 77 
actinomycete strains. Fermentations (50 mL) in AD 2 M 2 medium for fungi and DNPM 
medium for actinomycetes were aliquoted (10 mL) and extracted in small volumes 
with methanol (10 mL), acetone (10 mL), or MEK (14 mL) (see ref. 10 for media 
components). 

Just by counting the number of peaks between 4 and 19 min, the results indicated 
that there were no significant differences between acetone and methanol, although 
acetone yielded a slightly higher number of metabolites per isolate. Interestingly, 
MEK was ultimately selected as the solvent producing the most complex extracts; in 
fact it extracted compounds that were extracted by acetone or methanol as well as other 
compounds with less polarity (Fig. 4). The HPLC Studio application was then used to 
prioritize the extraction solvents for each strain, based simply on number of nonredundant 
compounds extracted (see Fig. 5). MEK was the solvent of choice (greater number of 
nonredundant peaks extracted) in about two-thirds of the cases, followed by acetone, 
which was the best in just 20% of the cases. 

With unknown mixtures, it is impossible to quantitate extracted compounds in an 
absolute sense; nevertheless, we used total ultraviolet (UV) absorbance (intensity) to 
estimate the extraction efficacy for each of the solvents. By this measure, MEK was the 
least effective, while acetone performed well, particularly with actinomycetes (see 
Fig. 6). If all three solvents had extracted the same amount of material, they would 
each produce a balanced value of 33.3% of total area counts. However, MEK values 
were about 22% for fungi and 15.6% for actinomycetes. This is not surprising, because 
acetone and methanol are miscible with water, whereas an initial solid/liquid extrac- 
tion and subsequent liquid/liquid partition occurs for MEK (inmiscible with water). 
However, in a qualitative sense, MEK reduced the abundance of polar chemicals and 
enriched the extracts with those of intermediate polarity. Given that the primary factor 
to consider when creating an HQ-NPL is chemical diversity in samples, MEK is the 
preferred solvent, even though it extracts a smaller amount of material. A quantitative 
loss can be adjusted later by increasing the fermentation volume by 15%, or by concen- 
trating samples. 

3.2. Selecting the Best Fermentation Format 

We also compared fermentations performed in different-sized vessels. Four systems 
were examined: the classic 50-mL flasks, 10-mL colony tubes, 10-mL EPA vials, and 
2.7-mL wells in 24-well plates. Twenty-two organisms (10 fungi in DNPM and 12 
actinomycetes in AD 2 M 2 medium) were fermented in these four formats and extracted 
with MEK. The results (Fig. 7) indicated that all fermentation formats performed 
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Fig. 4. Average number of metabolites per strain determined for 10 fungal and 12 actino- 
mycete strains as a function of extraction solvent. Data were based on comparing chemical 
diversity observed by high-performance liquid chromatography analysis at 210 nm. 
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Fig. 5. Prioritization of the organic extraction solvents determined with the high-perfor- 
mance liquid chromatography (HPLC) Studio tool for 69 fungal strains and 77 actinomycetes. 
Data were based on prioritizing the chemical diversity observed by HPLC analysis at 210 nm. 



equally well as measured by the average number of compounds produced. However, it 
is possible to obtain extracts with the same number of secondary metabolites but with 
different compositions. 

In order to determine similarity coefficients of extracts based solely on retention 
times, the commercial software BioNumerics™ was used for cluster analysis and rep- 
resentation of the dendrograms. HPLC profiles were compared by neighbor-joining 
analysis using the Dice similarity coefficient. The results indicated that, in a significant 
number of cases, all or almost all the extracts from the same strain clustered within the 
same group (results not shown). 

As a measure of quality control, once the computer selected the three best media, 
some isolates were re-fermented on a large scale, and the new extracts were compared 
with the original ones (see Fig. 8). Identical chromatograms were observed in 95% of 
the cases. The remainder showed only minor differences between small- and medium- 
scale formats. 
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Fig. 6. Relative Areas determined with the high-performance liquid chromatography (HPLC) 
Studio tool for 69 fungal strains and 77 actinomycetes extracts vs the organic extraction sol- 
vent. Data were based on prioritizing the area observed by 210 nm HPLC traces. 




EPA 



Colony 



Flask 



24 well 



Fig. 7. Average number of metabolites per strain determined for four different fermentation 
formats obtained for 10 fungal strains and 12 actinomycete extracts. Data were based on com- 
paring chemical diversity pairs observed by high-performance liquid chromatography analysis 
at 210 nm. 



Having confirmed reproducibility, we selected the EPA fermentation vials for the 
fermentation format. Based on the easy extraction automation of the EPA vials, they 
appeared to be the best choice overall for small-scale growth of microbial strains. 

3.3. Determination of the Relative Chemical Diversity for a Fermentation 
Condition 

As mentioned earlier, HPLC analysis can be used to characterize the relative chemi- 
cal diversity present in an organic extract obtained from microbial fermentations (14). 
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Fig. 8. Comparison between small- and medium-volume high-performance liquid chroma- 
tography (HPLC) chromatograms for the three extracts selected by the HPLC Studio software 
for the F-098432 strain. 



The diversity characterization begins by having the HPLC Studio software select the 
medium with the highest number of peaks. The number of peaks relative to the number 
of peaks in the chromatogram template indicates its diversity as a percentage. Peaks 
that are present in the best fermentation medium are then virtually removed from the 
template prior to the second step. Peaks in each of the other media are counted only if 
present in the partially depleted template, allowing the second-best medium (that which 
contributes to most of the remaining peaks) to be determined. This process is repeated 
until all the media are ranked, establishing a diversity characterization value for each 
extract (see a practical example in ref. 10). 
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3.4. Ensuring a Relative Extracted Quantity for a Fermentation 
Condition 

The overall objective was to select not only the fermentation medium resulting in 
the highest chemical diversity, but also to ensure that sufficient quantity of material is 
generated for an HQ-NPL. Thus the HPLC UV area was used in addition to the number 
of peaks in creating the final prioritization list. The user can determine the relative 
importance of this contribution for each comparison by deliberately selecting a to 
100 percentage. Each extract received a final ranking by combining both quantity and 
chemical diversity. 

Introducing a quantity parameter affected the order of prioritization that was obtained 
solely based on characterization of diversity. As the accumulated diversity percentage is 
also dependent on the order of accumulation, a recalculation of the diversity character- 
ization percentages for the final order was needed when areas were incorporated in the 
prioritization. That allowed us to determine the real percentage of the total of diversity 
that was attained by performing the final prioritization, both for diversity and quantity 
of accumulated values. 

3.5. Selecting the Best Overall Fermentation Conditions for a Single 
Strain 

The computer report generated (see Fig. 3) can then be used as a guide for determin- 
ing how many extracts would provide enough value to be included in an HQ-NPL. In 
that way, we could measure whether the inclusion of any given medium would dra- 
matically increase the total diversity. It is also possible to determine the number of 
extracts needed to reach a predetermined level of relative diversity and/or quantity. 

When ranking by the quantity parameter alone, six medium-scale fermentations were 
needed per strain to reach a 75% value for diversity. When only the diversity parameter 
was used, the number of medium-scale fermentations per strain decreased to four. On 
the other hand, the accumulated areas (quantity values) showed that the first three to 
four media prioritized covered half of the cumulative of the 10 fermentation conditions 
(10). 

With a prioritization balance set at diversity /quantity = 75/25, the difference achieved 
between three and four media was not significant enough to justify four medium-scale 
fermentations. Including more kinds of microorganisms in the medium-scale production 
would be a preferred option to amplify chemical diversity (10). Using only the first 
three media prioritized by the HPLC Studio software in medium-scale for additional 
microorganisms would be the most balanced choice. Such a selection would allow the 
creation of an HQ-NPL with more strains instead of preparing more medium-scale 
fermentations per microorganism. 

3.6. Selecting the Best Overall Fermentation Conditions 
for a Set of Strains 

Chemometric data for each fermentation medium could also be used to determine its 
usefulness for a specific set of microorganisms. With small populations of strains, the 
fermentation conditions that resulted in the greatest diversity and the greatest quantity 
for each strain did not overlap in most of the cases, and none of the media stood out in 
terms of their selection frequency (10). 
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Figure 9A,B shows, respectively, the selection frequencies for each of the 11 fer- 
mentation media when used with a representative subset of 50 actinomycetes, and pri- 
oritized either by the presence or absence of secondary metabolites (diversity) or by 
their accumulated areas (quantity). Divergence from the 45° line indicates its differ- 
ence from the average. The most productive media are above the line, the poorest ones, 
below. From the graph, we can infer that for this set of strains, the best media in terms 
of diversity were GOT, FR23, and MPG, whereas in terms of quantity the best media 
were MH, MPG, and FR23. On the other hand, the poorest media in terms of diversity 
were PV8 and RAM2, and clearly PV8 in terms of quantity (10). Such comparisons of 
similar microorganisms can be used to select optimal fermentation conditions and estab- 
lish a good panel of fermentation media for the preparation of an HQ-NPL. 

In terms of taxonomy, a large part of our fermentation program was devoted to acti- 
nomycetes. Therefore, we asked whether genera-specific responses to fermentations 
could be recognized (16). Several fermentation media were more frequently selected 
for each family of strains, throughout the range of all fermentation conditions. Only the 
families of Micromonosporaceae, Nocardiaceae, Pseudonocardiaceae, and Strepto- 
mycetaceae were represented with statistically relevant numbers of strains. None of the 
media selection frequencies observed for these four families reached 75%. The low 
values indicated again that there was not a preferred medium for most of the strains of 
these families, and confirmed the necessity of performing the prioritization study for 
each strain or family of strains for maximum exploitation of metabolite production (10). 

3. 7. Comparing Secondary Metabolites Profiles vs Taxonomy 

Historically, it has been widely accepted that increasing the chances of finding new 
drugs relies on the phylogenetic diversity and exploitation of different environments 
and geographic areas. Relying solely on taxonomy and strain origin is no guarantee of 
uniqueness, however. It is well known that unrelated microorganisms can produce the 
same compounds, and that closely related strains do not always produce a similar pat- 
tern of secondary metabolites (17). 

Microbial isolation and selection has traditionally relied on the expertise of the 
microbiologist to recognize, on the basis of morphological and genetic characteris- 
tics, the uniqueness of new strains obtained from the environment. In the last four 
decades, thousands of strains have been tested in industrial screening programs, and 
scientists inevitably test many repeats. According to the new philosophy of natural- 
products screening, the prioritization of strains to be included in an HQ-NPL is a key 
factor for the success of a screening effort. A number of approaches have been adopted 
to avoid screening duplicates: standardized chemotaxonomical analyses, molecular 
fingerprinting, DNA sequencing of marker regions, and metabolic profiling (18,19). 
According to our methodologies, by combining the metabolites of a given strain in a 
large variety of fermentation media, virtual chromatograms obtained with the HPLC 
Studio tool give scientists a good representation of its metabolic potential. 

Figure 10 shows the average number of secondary metabolites (in terms of different 
retention times) obtained for 20 representative actinomycete strains, classified accord- 
ing to their country of origin. Strains from different geographical origins behaved simi- 
larly. The degree of overlap among secondary metabolites produced by several strains 
was unpredictable. The number of different compounds obtained by combining all the 
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Fig. 9. High-performance liquid chromatography Studio software prioritization for the fermentation media in a 
representative subset of 50 actinomycete strains vs software prioritization conditions of chemical diversity (A) or 
quantity (B). The selection frequency indicates, as a percentage, the number of strains for which a fermentation condi- 
tion was prioritized with respect to the total number of strains tested. 
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Fig. 10. Average number of different metabolites obtained from 20 random actinomycete 
strains from each country of origin. 



strains of a region was determined in order to evaluate the overall metabolic potential 
diversity for each geographic location. When a subset of 20 randomly selected strains 
from each country were added, no significant differences in total number of metabo- 
lites per strain were observed among countries (approx 500 different secondary metabo- 
lites were detected in each case by the HPLC Studio tool). BioNumerics™ software was 
also used for cluster analysis of the HPLC chromatograms and graphical representation 
of dendrograms. For these phenetic analyses, HPLC data matrices generated with the 
HPLC Studio software were compared with BioNumerics™ in a pairwise manner using 
the Dice similarity coefficient and UPGMA (20). 

Three groups of 20 strains from three different countries (Mexico, Spain, and South 
Africa) were selected to see whether we could recognize correlations between taxo- 
nomic affiliation and metabolic profiles (Fig. 11A-C). Dendrograms generated after 
cluster analyses showed that, in general, strains belonging to the same genus clustered 
together with a high percentage of similarity, indicating that taxonomically related 
strains may have some degree of similarity in their metabolic pathways. However, in 
some cases, certain strains were unique in their metabolic profile, as they failed to 
cluster with the other representatives of the same taxon. That was the case, for instance, 
for the Saccharothrix strain F095473, isolated from a Mexican soil (Fig. 11A), or for 
four Actinoplanes isolates from South Africa that clustered separately with the other 
Actinoplanes (Fig. 11C). 

3.8. Comparing Secondary Metabolite Profiles vs Geographical Origin 

We also wanted to examine the metabolic diversity of microorganisms isolated from 
different countries. To address the question of whether increasing the geographical 
diversity of the strains results in a parallel increase of the overall chemical diversity, 
we performed a cluster analysis of the HPLC chromatograms produced by strains of 
Streptomyces and Nocardioform isolated from different countries (Fig. 12A, B, respec- 
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Fig. 11. Dendrograms showing the relationships among 20 actinomycete strains from Mexico, Spain, and South Africa, based on high-perfor- 
mance liquid chromatography profiles (UPGMA, Dice >50%Mean). The taxonomy strains at the genus level are indicated. 
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Fig. 12. Dendrograms showing the relationships between some Streptomyces and Nocardioform 
isolates from Panama, Costa Rica, and Spain, on the basis of their high-performance liquid chroma- 
tography profiles (UPGMA, Dice >50%Mean). 



tively). We observed that several cultures of the same taxon clustered together with a 
high percentage of similarity, suggesting that identical or similar profiles were pro- 
duced by these strains, independently of their origin. This result confirmed the well- 
known fact that many types of strains are ubiquitous, especially within the genus 
Streptomyces, and can be found in very different environments. There are also many 
similar secondary metabolite pathways present in phylogenetically unrelated strains 
(1 7). However, it was observed that not all the strains were found within a cluster at the 
established cut-off point, and some strains of a given taxon also showed unique meta- 
bolic profiles. 

4. Conclusions 

This study shows that the number and quantity of secondary metabolites obtained 
from microorganisms is measurable, and that the number of medium-scale fermenta- 
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tion conditions needed to encompass suitable chemical diversity can be estimated. The 
development of a new computational tool, the HPLC Studio software, gave us a valu- 
able instrument with which to attempt the rational design of a collection of natural 
products, allowing us to focus on and direct the efforts needed to reach a certain thresh- 
old of quantity and chemical diversity in a natural-products library. 

Likewise, we were able to set a numerical value on the ability of newly isolated 
strains to produce a range of different secondary metabolites. Some limited correla- 
tions were observed between taxonomy and metabolite production. Phylogenetically 
related strains were grouped into clusters according to their metabolite production, 
regardless of their country of origin. 

All these data were treated by computer-assisted statistical software. Integrating such 
a tool in the design of a library of natural products derived from microorganisms pro- 
vided us with additional criteria for the analysis of the metabolites obtained, and hence 
gave us rational criteria for the design of high-quality natural-products libraries with 
increased possibilities for finding new biologically active and pharmacologically rel- 
evant compounds. 
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Summary 

Kosan Biosciences was founded on the principle that the chemical diversity of microbial metabo- 
lites, which for decades have been a rich source of natural-product drug leads and therapeutically 
important drugs, can be increased by altering the function of the genes and enzymes that govern the 
production of these metabolites. In particular, the predictable relationship between the structure and 
function of the modular type of microbial polyketide synthases has enabled genetic manipulation 
("engineering") of the producing organism for production of novel forms of various classes of natu- 
rally occurring compounds, such as macrolide antibiotics (erythromycins and FK520) and certain 
antitumor agents (epothilone D and the geldanamycins). This resembles the approach used by medici- 
nal chemists who synthesize analogs and derivatives of lead compounds in an attempt to improve upon 
existing drugs or find new ones. Expression of the native or engineered polyketide synthase genes, 
as well as others that govern metabolite formation, in heterologous hosts is an important aspect of 
developing commercial systems for drug production. This chapter highlights advances made by 
Kosan Biosciences and some other groups in this area of natural-products drug research. 

Key Words: Antibiotics; anticancer drugs; biosynthesis genes; drugs; erythromycin; 
epothilone; FK520; geldanamycin; polyketide; polyketide synthase; rapamycin; secondary natural 
products. 

1. Introduction 

1. 1. Approach 

For more than six decades, the chemical diversity of microbial metabolites has made 
them a rich source of natural-product drug leads and therapeutically important drugs. 
Notwithstanding the fact that compounds with new structures continue to be uncovered 
regularly (1,2), there is a pervasive belief that natural products are a waning source of 
new drugs. Certainly, there has been a precipitous decline in new chemical entities that 
have been registered with the US Food and Drug Administration (FDA) over the past 
5 yr, including ones from natural sources. On the other hand, Craig et al. (2) have noted 
that for the areas of cancer and infectious diseases, 60% to 75% of the drugs used to 
treat these diseases are of natural origin, based on the numbers of new drugs approved 
during the period 1983-2002. Part of the explanation for the decline lies in the de-emphasis 
of natural-product discovery programs by large pharmaceutical companies over the 
past 10 yr. However, it is likely that inherent limitations in the ability of traditional 
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screening methods to access the full range of chemical diversity in nature has also been 
a contributing factor. 

One approach to overcome this dearth of naturally derived drug leads involves the 
identification and use of the genes governing the biosynthesis of natural products to 
create new compounds as well as to improve the economics of their production. Re- 
search into the genetics of microbial secondary metabolite production since 1980 has 
provided a wealth of knowledge about its genetic and biochemical basis (3,4). Such 
information, together with the widely held belief that the enzymes of secondary meta- 
bolic pathways are less fastidious overall than their primary metabolism counterparts, 
has stimulated the growth of a research area called "combinatorial biosynthesis." It 
encompasses studies of the properties of nonnatural combinations of biosynthesis genes 
with the aim of exploiting the enzymes' sloppiness for the production of new natural 
products (5,6). Because this research is largely being done with bacteria, it falls within 
the larger field of the metabolic engineering of culturable microorganisms. 

1.2. Polyketides and Polyketide Synthases 

Microbes have the capacity to produce a multitude of structurally complex chemi- 
cals by adapting processes of primary metabolism for self-defense and cell signaling 
purposes. For instance, long-chain fatty acids are made to provide components of cell 
membranes and to store energy generating potential as triacylglycerides, whereas en- 
zymes called polyketide synthases (PKSs) evolved the capability of making a vast num- 
ber of compounds known as polyketides from the same substrates used by fatty-acid 
synthases, and largely by the same type of biochemistry (7). Examples of two types of 
PKSs and their associated polyketides are illustrated in Figs. 1 and 2. Type II PKSs 
(Fig. 1) consist of a collection of largely monofunctional proteins that catalyze the 
formation of typically polycyclic aromatic compounds from acetate and malonate only. 
Type I PKSs (Fig. 2), in contrast, use large multifunctional proteins to make 
polyoxygenated, aliphatic compounds from several different kinds of acyl-coenzyme 
A substrates. 

The so-called modular PKSs (8-10) are featured in this chapter because these en- 
zymes have enabled the most fruitful genetic engineering route to structural variants 
(analogs) (11,12) of polyketides that are important therapeutic drugs, like the antibac- 
terial erythromycin A (Fig. 2) or experimental agents like 17-allylamino-17-demethoxy 
geldanamycin (17-AAG), which currently is undergoing clinical trials as an antitumor 
drug (discussed later). Three other types of PKSs are not discussed here; for reviews, 
see refs. 7,13. 

A modular PKS consists of large, multifunction proteins, each with different combi- 
nations of domains, called modules, that function like the constituent biochemical ac- 
tivities of fatty-acid synthases. For 6-deoxyerythronolide B synthase (DEBS), the PKS 
that forms the backbone of the erythromycins and is encoded by the three eryAI-111 
genes (8,14), 6-deoxyerythronolide B (6dEB) is produced by the successive condensa- 
tion of one propionyl-coenzyme A (CoA) and six 2-methylmalonyl-CoA molecules 
(Fig. 2). The three subunits of DEBS have two modules, each of which contains the 
activities needed for one cycle of polyketide chain elongation, as illustrated by the 
structures of the six enzyme-bound intermediates in Fig. 2. Every module contains a 
ketosynthase (KS), an acyltransferase (AT), and an acyl carrier protein (ACP) domain, 
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Fig. 1. An illustration of the mechanism of the type II polyketide synthase (PKS) involved in 
the biosynthesis of tetracenomycin Fl and C (68). The PKS consists of individual protein sub- 
units that act in concert to assemble the acetate starter unit (produced by decarboxylation of 
enzyme-bound malonate) and 9 chain extender units into an TcmM-bound decaketide by an 
iterative process involving a malonyl-CoA:ACP acyltransferase and the proteins TcmJ, TcmK, 
TcmL and TcmM. The latter is cyclized to Tcm F2 by the TcmN enzyme, with assistance by 
TcmJ, then Tcm F2 is cyclized once more by Tcml to form Tcm Fl. The latter intermediate is 
converted to tetracenomycin C by tailoring enzymes. 



which together catalyze a two-carbon extension of the chain between a starter unit 
(e.g., propionyl-CoA or some enzyme-bound acylthioester intermediate) and a chain 
extender unit (e.g., 2-methylmalonyl CoA). The AT domains are specific for 2- 
methylmalonyl CoA except for the AT in the didomain loading module, which prefers 
propionyl CoA. After each two-carbon unit condensation, the oxidation state of the (3- 
carbon is either retained as a ketone (e.g., module 3), or modified to a hydroxyl, 
methenyl, or methylene group by the presence of a ketoreductase (KR) (e.g., module 
2), a KR + a dehydrase (DH), or a KR + DH + an enoyl reductase (ER) (e.g., module 4), 
respectively. Polyketide assembly is often terminated by a thioesterase (TE) domain at 
the C-terminus of the final module (module 6) to release the product from the protein. 
For DEBS, release is coincident with the intramolecular cyclization that results in the 
formation of 6dEB, a typical polyketide macrolactone product. Synthesis of 6dEB is 
followed by C6-hydroxylation {eryF) to yield erythronolide B. Addition of the sugar 
L-mycarose (via thymidine diphospho(TDP)-mycarose) yields 3-O-oc-mycarosy- 
lerythronolide B, and the addition of desosamine (via TDP-desosamine) yields eryth- 
romycin D. The two sugars are produced by independent pathways (not shown) but 
controlled by the genes designated eryB (mycarose) and eryC (desosamine). The final 
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Fig. 2. An illustration of the mechanism of the type I modular polyketide synthase involved in the biosynthesis of 6dEB. 
Each of the 6-deoxyerythronolide B synthase (DEBS) subunits is represented by a broad, open arrow containing the relevant 
domains in each module. Key enzyme-bound intermediates of carbon chain assembly are shown bound to the acyl carrier 
protein domains. Assembly begins at the loading didomain of the first DEBS subunit, upon attachment of propionate which 
then reacts with one equivalent of bound 2-methylmalonate, obtained from its CoA ester. Further equivalents of 
2-methylmalonyl-CoA are used by DEBS to produce 6dEB, as explained in the text. 6dEB is then converted to the erythro- 
mycin A-D glycosides by tailoring enzymes. 
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steps are hydroxylation of erythromycin D to yield erythromycin C by a second P450 
enzyme (eryK) and O-methylation of the mycarosyl residue (eryG) to yield the 
cladinosyl moiety in erythromycin A and B. 

In effect, the AT specificity and the types of catalytic domains within a module 
serve as codes for the structure of each two-carbon unit; the order of the modules in a 
PKS specifies the sequence of the distinct two-carbon units, and the number of mod- 
ules determines the length of the polyketide chain. Variations in the acyl-CoA sub- 
strates used by a modular PKS, the number of domains within a module and the number 
of modules in the PKS are responsible for establishing the first set of structural charac- 
teristics of the polyketide; differences in the kinds of biochemical transformations the 
compound produced by the PKS undergoes are dictated by the products of the "tailor- 
ing enzymes" (eryB, eryC, eryF, eryG, and eryK for the erythromycins) and establish 
the final structure. Consequently, engineering a microorganism to produce novel 
polyketides can involve altering only the PKS genes or both them and the tailoring 
genes. These ideas are illustrated below. 

1.3. Research and Development Strategy at Kosan Biosciences 

To expedite drug discovery and the early stage development process, Kosan has 
chosen to identify microbial polyketides known to have some type of biological activ- 
ity and therapeutic potential, then use genetic engineering to alter the compound's struc- 
ture in ways that could improve its utility (e.g., increasing potency or correcting some 
remediable limitation in its pharmacology, pharmacokinetics, or pharmacodynamics) 
or simply the economics of its large-scale production. As far as possible, the changes 
sought by gene manipulation are based on knowledge of the structure-activity relation- 
ships and molecular modeling of drug:target interactions. We do not make libraries of 
polyketides to screen for activity against some biological target as the route to identifi- 
cation of lead molecules. This could become feasible, however, if suitable advances in 
PKS engineering are made through the approaches currently being explored, as ex- 
plained in Subheading 5.2. 

1.4. Other Approaches to Discovering New Drugs From Natural 
Products 

The time-honored method of screening extracts of culturable microorganisms for 
biological activities of interest, followed by purification of the responsible metabolites 
to identify lead compounds, when necessary supported by an ensuing analog synthesis 
program, has fallen out of favor in large pharmaceutical companies. Nonetheless, sev- 
eral chapters are devoted to this topic in this volume. Genetics-based approaches, such 
as (1) scanning the genome of micoorganisms by high-throughput DNA sequencing 
for novel sets of secondary metabolism genes that can be caused to be expressed 
through manipulation of growth conditions or (2) constructing gene libraries from the 
DNA of uncultured microorganisms and expressing the clusters of secondary metabo- 
lism genes identified by DNA sequencing in heterologous hosts, are gaining favor and 
thus are covered elsewhere in this book. It remains to be seen whether either approach 
can uncover drug leads at a pace that will sustain their pursuit commercially. 
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2. The Macrolide Antibiotic Paradigm 
2. 1. Erythromycins and Their Analogs 

Structure-function studies of DEBS carried out over the past 13 years has created a 
paradigm for modular PKSs that has guided drug discovery research based on the engi- 
neering of any modular PKS. The pioneering work of Katz and co-workers on the eryA 
genes (8) and the initial discovery that DEBS consists of multifunctional proteins (14) 
pointed the way to making numerous 6dEB analogs through domain mutation or re- 
placement experiments (15). In some cases, the 6dEB analogs were converted into 
erythromycin analogs by conducting the work in the erythromycin producing organism 
Saccharopolyspora erythraea, or by using DEBS-null mutants of this organism to 
bioconvert the 6dEB analogs initially made in a heterologous host organism such as 
Streptomyces coelicolor (16). Analogs of 6dEB and the erythromycins have been pro- 
duced that originate from nonnatural starter units (17,18), which lack one or more 
branched methyl groups (15,19-21), have one methyl group replaced with an ethyl 
group (22), have different oxidation states at certain hydroxy-bearing positions (23,24), 
are hydroxylated at different positions (20,21), or have double bonds in place of nor- 
mally saturated positions (25). Compound libraries containing approximately 50 6dEB 
analogs have been produced also (26). A subset of these compounds was then con- 
verted to their 5-O-desosaminyl glycosides (27). Novel erythromycin-like compounds 
have also been made by adding a module to DEBS (15,28). Representative examples of 
such analogs are shown in Fig. 3. 

There are underlying assumptions in the above work: (1) any structural change ef- 
fected by engineering the PKS domain governing some step in the assembly process 
will not affect the subsequent steps in the process severely enough to obviate formation 
of the analog of the normal PKS product; and (2) any structural change effected by 
engineering or inactivating a specific tailoring enzyme gene will also not obviate for- 
mation of the analog of the complete polyketide. The second assumption has been 
tested to some extent for erythromycin (29) and other polyketides (30), and the results 
of the work summarized above have established certain boundaries on what can be 
expected for the genetic engineering of modular PKSs in general. 

The success of such experiments has often been viewed by the relative amount of 
product produced by the engineered vs wild-type microorganisms under comparable 
fermentation conditions. This can vary from 0.1% to nearly 90% of the normal amount, 
with lower yields seen more often than ones close to that of the wild-type system. 
Changes in the functionality present in different positions of 6dEB have been achieved 
with a wide range of success (i.e., yield), and it has not been possible to explain the 
failures observed or predict with confidence which additional changes could be 
achieved. Nevertheless, the successful production of the erythromycin analogs achieved 
to date clearly validates the overall approach that was built on the first successful ex- 
periments reported by Donadio et al. in 1991 (8). Furthermore, attempts are being made 
through carefully designed and executed in vitro experiments with the individual mod- 
ules of DEBS and its intact subunits to define the parameters that govern the relative 
activity of engineered modular PKSs; e.g., see ref. 31). 
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Fig. 3. Examples of six different erythromycins produced by engineered 6-deoxyerythronolide B synthase genes. R = H or OH. The 
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Fig. 4. The structures of the immunosuppressants, rapamycin, FK506 and FK520. The 
methoxy groups of FK520 at positions 13 and/or 15 have been replaced with hydrogen, methyl 
or ethyl groups as discussed in the text. 



2.2. Engineering Other Macrolactone PKSs 

2.2.1. FK520 

Three macrolactone antibiotics FK506, FK520, and rapamycin (Fig. 4), different 
from erythromycin and other antibacterial macrolides, are used therapeutically in or- 
gan transplantation for their immunosuppressive property. Five analogs of FK520, rep- 
resenting substitutions of the methoxy groups positions 13 and 15 with ethyl, hydrogen, 
or methyl groups, have been produced by appropriate AT domain exchange experi- 
ments directed at the relevant PKS genes (32). The 13-hydrogen analog of FK520 re- 
tained the in vitro properties typical of the immunosuppressive FK506 and FK520, 
whereas the 13-methyl analog displayed greatly reduced suppression of T-cell activa- 
tion and, along with its 18-hydroxy derivative (prepared chemically), has apparent 
nerve growth stimulatory activity (32). Five different rapamycin analogs have been 
produced by the same approach and are under evaluation at Kosan Biosciences (S. 
Bondi, J. Kennedy, L. Tang, S. Ward, and C. R. Hutchinson, unpublished work). 

2.2.2. Epothilones 

Epothilones B and D are polyketide macrolactones from subspecies of the myxo- 
bacterium Sorangium cellulosum (33), which are undergoing clinical trials as antican- 
cer drugs (34). They inhibit cytoskeletal formation by increasing the stability of 
microtubules during their formation, which interferes with formation of the mitotic 
spindle and results in potent arrest of cancerous as well as normal cell growth at the 
G2/M transition of cell development (35). Because epothilone D seems to have a better 
therapeutic index than epothilone B, at least in animals (36), this compound was 
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Fig. 5. The structures of epothilone B and D, and novel analogs of epothilone D that have 
been produced by engineered epothilone polyketide synthase genes. 



selected for development at Kosan Biosciences and is in early Phase II trials as KOS- 
862. A commercial process for its large-scale production has been established that re- 
lies on expression of the cloned epothilone PKS genes (37,38) in Myxococcus xanthus 
(39). The convenience of performing gene engineering experiments in this organism 
has facilitated making several analogs of epothilone D (40); the formation of two of 
these, 10,ll-didehydro-12,13-dihydro-13-hydroxyepothiione D (40) and 10,11- 
didehydro-epothilone D (41) (Fig. 5), by inactivation of the DH and ER domains, 
respectively, in module 5 of the epothilone PKS, has shed light on the mechanism of 
formation of the unusual c/s-12,13 C=C (40). 

3. Geldanamycin Analogs by Gene Engineering 

Geldanamycin and the closely related herbimycin A (Fig. 6) were initially discov- 
ered by virtue of their weak antibacterial and antifungal (42) or plant-growth inhibitory 
(43) properties. Interest in these benzoquinone ansamycins increased greatly upon the 
discovery of the antitumor properties of geldanamycin (44) and herbimycin A (45). 
Neckers and co-workers (46) showed in 1994 that their principal cellular target is 
Hsp90, a ubiquitous and abundant protein chaperone of mammalian cells (47). 
Geldanamycin competes with ATP for the ATP-binding site of Hsp90 and, when bound, 
inhibits the ATP-dependent functions of this protein. A particular function is the abil- 
ity of Hsp90 to chaperone nascent protein kinases that are critical components of signal 
transduction pathways, especially those in certain cancer cells (47). In the presence of 
geldanamycin or herbimycin A, the immature kinases undergo rapid degradation, as a 
consequence of ubiquitination and subsequent catabolism by the proteosome, and the 
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Fig. 6. The structures of geldanamycin, herbimycin A and the 17-substituted geldanamycins 
17-AAG and 17-DMAG discussed in the text. 



levels of the mature kinases become depleted. This can result in a cytostatic effect on a 
cancer cell or in some cases apoptosis and cell death. 

The discovery that Hsp90 and one or more of its protein kinase cohorts are overpro- 
duced in several types of human cancers has led to considerable interest in geldanamycin 
and its analogs as potential anticancer drugs (48,49). Many geldanamycin analogs have 
been produced by replacement of the C17 0-methoxy group with substituted amines 
(50). 17-AAG (Fig. 6), is currently undergoing phase I and early-stage phase II clinical 
trials (49,51); and a more water-soluble analog, 17-dimethylaminoethylamino-17- 
demethoxygeldanamycin (17-DMAG) (Fig. 6), has completed preclinical develop- 
ment (52). 

Additional geldanamycin analogs are being sought in an attempt to ameliorate the 
hepatotoxicity that is characteristic of geldanamycin, although this is lessened by the 
17-amino substitutions noted above (53,54). One approach is to make demethyl and 
demethoxy analogs by PKS gene engineering, which might show increased potency, or 
nonquinone analogs by inactivation of the tailoring genes, which might have an improved 
therapeutic index, because the quinone moiety is believed to facilitate formation of free 
radicals in vivo (53,54). The geldanamycin biosynthesis genes were cloned and char- 
acterized from Streptomyces hygroscopicus NRRL 3602 (55) to enable genetic engi- 
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neering experiments; then recombinant strains carrying the gdmA PKS genes with AT 
domain replacements in modules 4, 5, or 7 of the gdmA2 and gdmA3 PKS genes (55) 
were created that produced novel geldanamycin analogs like the ones shown in Fig. 7 
(Bianka Hadatsch, Z. Hu, R. McDaniel, K. Patel, M. Piagentini, A. Rascher, L. Vetcher, 
and C. R. Hutchinson, unpublished work). Inactivation of the gdmM monooxygenase 
gene led to formation of the phenol analog shown; the similar compound formed by the 
module 7 AT mutant must reflect its inability to serve as a substrate for the GdmM 
enzyme (A. Rascher and C. R. Hutchinson, unpublished work). Such compounds can 
be investigated as leads for drug development. 

4. Polyketide Genes From the Metagenome of Marine Organisms 
4. 1. Marine Natural Products As Drug Leads 

Cytotoxic, antibacterial, antifungal, anti-inflammatory compounds and many other 
types of natural products have been isolated from different kinds of marine organisms 
(see ref. 56 and references therein). However, the supply of such potentially valuable 
drug leads has customarily been hampered by an inadequate source, as a result of the 
limited availability or difficult accessibility of the producing organism, which usually 
is very difficult to grow even by aquaculture. It is widely believed that microbial sym- 
bionts make marine-derived compounds like the polyketides that are commonly found 
in terrestrial microorganisms (57,58). Consequently, it should be possible either to iso- 
late pure cultures of these microbes to enable production of the compounds in large 
amount or to clone the production genes from the metagenome of the marine organism, 
if a producing microbe cannot be isolated. Once they are cloned, it is feasible to express 
the genes in different microbial hosts as the means to determine what is produced and 
whether it is the desired drug (58). Limitations to achieving this goal could be inappro- 
priate expression devices, lack of a unique substrate for a pathway enzyme, or auto- 
toxicity of the produced compound to the host. 

Kosan is interested in pursuing the development of antitumor drugs from marine 
sponges and, as a consequence, has investigated the metagenome of Discodermia 
dissoluta (R. Gadkari, C. Reeves, A. Schirmer, and C. R. Hutchinson, unpublished 
work). Libraries containing many thousands of clones were made in fosmid and bacte- 
rial artificial chromosome (BAC) vectors, using total DNA from the sponge and from 
fractions enriched for unicellular or filamentous microorganisms, then screened for the 
presence of the KS domains of modular PKS genes by polymerase chain reaction (PCR) 
analysis and by direct hybridization methods using suitable gene probes. Typical results 
are summarized in Table 1. 

These data firmly establish that (1) a diverse range of KS homologs can be cloned 
and characterized in total DNA from the sponge samples or from bacterially enriched 
cell fractions, (2) bacterial genes can make up the majority of clonable genes from the 
metagenome of the sponge, and (3) in the case studied, multimodular PKS genes can be 
comparatively rare whereas PKS-NRPS genes can be quite common in the filamentous 
bacterial cells. It is unclear why the sponge genes, i.e., eukaryotic DNA that is pre- 
sumed to come from either sponge cells or eukaryotic symbionts like diatoms and 
dinoflagellates, made up such a small percentage of the types of genes clonable from 
the total sponge DNA preparations. A possible explanation is related to the fact that the 
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Fig. 7. Structure of the geldanamycin analogs made by engineered gdmA2A3 polyketide 
synthase genes or by the strain with a disrupted gdmM gene. The positions converted to 
demethyl or demethoxy positions by acyltransferase swaps in the three different domains of 
gdmA2 or gdmA3 are encircled. 



results of cytological staining studies show that the D. dissoluta tissue contains at least 
100 times more microbial cells than sponge cells; if for some reason the bacterial DNA 
is more readily clonable than sponge-cell DNA, then the preponderance of the former 
type of genes is understandable. 

5. Future Directions for Engineering Drug Production by Bacteria 

5.1. Escherchia coli As the Host for Large-Scale Production 

New types of microbial hosts are being studied as alternatives to the widely used 
filamentous actinomycetes bacteria for the commercial production of polyketides and 
other types of natural products. An example of the use of a common laboratory strain of 
a myxobacterium to produce epothilone D is presented above. Escherichia coli is an- 
other possibility, but, unlike the myxobacteria and actinomycetes, it does not contain 
all the substrates commonly used by a modular PKS, or some of the necessary cofac- 
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Table 1 

Distribution of Modular Polyketide Synthase (PKS) Gene Cluster 

in Metagenomic Libraries of Discodermia dissoluta 





Unicellular 


Filamentous 






bacterial 


bacterial 




Library made from: 


enrichment 


enrichment 


Total sponge 


Fosmids screened 


64,000 


36,000 


55,000 


PKS positive fosmids" 


465 (0.7%) 


185 (0.5%) 


375 (0.7%) 


Fosmids with bacterial inserts 


>90% 


>60% 


>80% 


PKS fosmids endsequenced 


145 


148 


168 


Fosmids with 2 PKS ends fc 


2% 


1% 


2% 


Fosmids with 1 PKS end f 


29% 


47% 


41% 


Fosmids with no PKS end rf 


68% 


52% 


57% 


PKS fosmids with an NRPS e end 





12% 


1% 



" By hybridization with sponge-derived KS probe pools at low stringency 

b Contains part of large PKS gene cluster (>5 modules) 

c Contains start or end of PKS gene cluster of unpredictable size 

d Contains small PKS gene cluster (1-5 modules) 

e nonribosomal peptide synthetase 



tors. Nevertheless, attempts are underway to introduce these features into typical K12- 
derived E. coli strains and to adapt common expression tools to handle the 5- to 40-kb 
modular PKS genes. To date, E. coli strains have been engineered to contain the genes 
that convert fed propionate to its CoA form and turn propionyl CoA into 2-methyl- 
malonyl CoA without catabolism of propionate to methylcitrate (59,60), which enable 
formation of 2-methylmalonyl CoA from fed glucose via the tricarboxylic acid cycle 
(but only if a precursor of vitamin B12 is added to the fermentation) (61), and which 
convert the apo form of the modular PKSs into their pantotheinylated, enzymatically 
active forms (59). An investigation of the variables in expression of the substrate sup- 
ply, modification, and PKS genes that affect the amount of 6dEB produced by such 
strains when grown in shake flasks (62) has set the stage for optimization of a high cell- 
density, fed-batch fermentation-based process that has already achieved 1.1 g/L titers 
of 6dEB (63). Further research is likely to make E. coli as attractive as M. xanthus or 
even typical actinomycetes for production of relatively simple polyketides — i.e., ones 
like epothilone D for which the PKS product does not undergo extensive tailoring reac- 
tions that would require introduction of multigene clusters for complex metabolic path- 
ways. 

5.2. Creation of Novel PKS Genes 

An exciting possibility is on the horizon — that novel polyketides could be manufac- 
tured by unusual combinations of the basic building block of modular PKSs: a module 
that contains all the information for one round of carbon chain extension for a given 
substrate. Research into the factors governing the choice of chain extension substrate 
and the ability of the incoming, PKS-tethered substrate to react with this 2-carboxy- 
acylthioester, once it is loaded onto the ACP domain of a module, is beginning to 
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establish some boundary conditions; see ref. 64 and references therein. Earlier studies 
had shown that the productive interaction of two separate modules, when presented 
with typical acylthioester substrates, was greatly influenced by unique, short amino 
acid sequences, called "linkers," found at both ends of the protein and even between 
modules within multimodular proteins (65-67). Consequently, it is logical to assume 
that collections of individual modules containing appropriate linkers could be screened 
in bimodular combinations for their ability to convert a given acylthioester starter unit 
and 2-carboxyacylthioester chain extension substrate to some product. Productive 
bimodular combinations (defined by kinetic parameters and/or relative yields as a func- 
tion of particular starter and extender substrates) could then be challenged with a col- 
lection of additional modules, and so forth to define the ones that made certain desired 
products or just any one of a set of theoretically possible products. The groundwork 
to expedite such research has been laid at Kosan Biosciences; thus, the time is ripe to 
exploit how multimodular PKS genes themselves must have evolved, in the attempt to 
expand the repertoire of therapeutically useful natural products nature has provided. 
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Improving Drug Discovery From Microorganisms 
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Summary 

Microorganisms remain unrivalled in their ability to produce bioactive small molecules for 
drug development. However, the core technologies used to discover microbial natural products have 
not evolved significantly over the past several decades, resulting in a shortage of new drug leads. 
Advances in DNA-sequencing and bioinformatics technologies now make it possible to rapidly 
identify the clusters of genes that encode bioactive compounds and to make computer predictions of 
chemical structure based on gene sequence information. These structure predictions can be used to 
identify new chemical entities and provide important physicochemical "handles" that guide com- 
pound purification and structure confirmation. Industrialization of this process provides a model for 
improving the efficiency of natural-product discovery. The application of advanced genomics and 
bioinformatics technologies is now poised to revolutionize natural-product discovery and lead a 
renaissance of interest in microorganisms as a source of bioactive compounds for drug development. 

Key Words: Natural products; genomics; drug discovery; bioinformatics; actinomycetes; 
dereplication; fermentation; structure elucidation. 

1. Introduction 

Microorganisms produce some of the most important medicines ever developed. 
They are the source of lifesaving treatments for bacterial and fungal infections (e.g., 
penicillin, erythromycin, streptomycin, tetracycline, vancomycin, amphotericin), can- 
cer (e.g., daunorubicin, doxorubicin, mitomycin, bleomycin), transplant rejection (e.g., 
cyclosporin, FK-506, rapamycin), and high cholesterol (e.g., statins such as lovastatin 
and mevastatin) (Fig. 1). Microbial natural products are notable not only for their po- 
tent therapeutic activities, but also for the fact that they frequently possess the desirable 
pharmacokinetic properties required for clinical development. The drugs shown in Fig. 1 
are just a few of the many microbial natural products that reached the market without 
any chemical modifications required, a testimony to the remarkable ability of microor- 
ganisms to produce drug-like small molecules. Indeed, the potential to hit a "home 
run" with a single discovery distinguishes natural products from all other sources of 
chemical diversity and fuels the ongoing efforts to discover new compounds. 

Traditionally, the search for new natural products has started by growing microor- 
ganisms in the laboratory and testing the fermentation broths for bioactivity. However, 
we now know that microorganisms have many natural-product gene clusters that they 
do not readily express when grown in the laboratory (1-3). So despite decades of fer- 
mentation-broth screening, it is likely that a vast supply of bioactive microbial cora- 
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Fig. 1. A few of the landmark medicines produced by microorganisms. 



pounds remains to be discovered. This untapped potential provided the impetus for us 
to develop an automated genomics platform that predicts the chemical structures of 
natural products by reading the sequences of the gene clusters that direct their synthe- 
sis. By surveying the genome, we can identify all of the natural products that a micro- 
organism can make before fermentation studies begin, and specifically tailor the 
downstream production and purification strategies to isolate likely new chemical enti- 
ties (NCEs) and avoid the re-isolation of known compounds. The integration of new 
genomics technologies greatly increases the efficiency of discovery and makes it pos- 
sible to build a robust pipeline of NCEs from a small collection of microorganisms, 
providing a new paradigm for natural-product discovery. Here, we describe the core 
technologies behind the discovery platform developed at Ecopia BioSciences Inc. and 
present the discovery of a new antifungal agent as a case study to illustrate the power of 
the genomics-guided approach. 
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2. Genomics-Guided Natural-Product Discovery 

2. 1. Genome Scanning Technology 

The development of a discovery platform that can predict chemical structures from 
gene sequences required the development of several new technologies and resources, 
including a means to efficiently isolate and sequence natural-product gene clusters from 
microbial genomes; a large reference database of gene clusters linked to the structures 
of the compounds they encode; and specialized computer applications that can detect 
correlations between gene sequence and chemical substructure elements. We devel- 
oped a high-throughput genome-scanning method to sequence natural-product gene 
clusters without sequencing entire genomes (3). Our strategy was to scan the genomes 
of selected microorganisms that were reported to produce known, structurally diverse 
natural products and to build a database of gene clusters covering the full range of 
natural-product chemical diversity (the Ecopia Decipher® database). The approach 
proved to be successful, as in all cases the gene clusters corresponding to known natu- 
ral products were identified by deductive analysis, providing an important training set 
for chemical structure predictions. The most striking finding, however, was the large 
number of unexpected gene clusters found in these previously well-studied microor- 
ganisms. Genome scanning of approximately 60 actinomycete strains revealed some 
700 natural product gene clusters, or an average of a dozen gene clusters per organism. 
This number exceeds by at least a factor of ten the number of natural products that 
would have been detected from these organisms by traditional screening approaches 
(Fig. 2). It is now clear that many gene clusters are expressed only under certain growth 
conditions. Furthermore, even when they are expressed, some gene clusters produce 
compounds only at very low levels, below the limit of detection of conventional screen- 
ing methods. This may explain why so many natural products have eluded detection in 
the past, as it was common practice to screen only one to three growth conditions for 
each strain. 

2.2. Genomics-Guided Discovery Platform 

To capitalize on the wealth of gene clusters revealed by genome scanning, we devel- 
oped a genomics-guided discovery platform designed to rapidly identify clusters en- 
coding likely NCEs and target the compounds for purification (Fig. 3). A suite of 
specialized software and computer applications predicts the structures of compounds 
encoded by new gene clusters via automated comparisons with known clusters in the 
database. These structure predictions identify possible NCEs and provide important 
physicochemical "handles" (including molecular weight, ultraviolet [UV] absorbance, 
lipophilicity, and other properties) that are then used to detect the desired compound in 
fermentation broths. To fully exploit their potential, each microorganism is grown in as 
many as 50 different fermentation media in order to maximize the probability that each 
of its gene clusters will be expressed. A number of custom-made analytical tools are 
used to simultaneously display and analyze mass spectroscopy, UV, and bioactivity 
data generated from extracts prepared from all the growth conditions used. These tools 
make it possible to detect compounds that may be produced only rarely in fermentation 
broths, or at very low levels, and to identify those compounds whose properties match 
the gene-cluster predictions. The structure information provides practical handles to 
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guide the purification of targeted compounds and greatly facilitates the final structure 
elucidation by spectroscopic methods. 

The tremendous power of genomics-guided discovery is driven by a database and 
computational platform that "learns" from previous results and improves with each 
new compound discovered. In the final step of the discovery cycle, the confirmed struc- 
ture of a new molecule is linked in the Decipher® database to the gene cluster that 
encodes it, thus enhancing the ability of the system to make future correlations between 
genes and chemical structures. Even the "rediscovery" of known compounds adds valu- 
able new information to the database, as the genetic blueprint for each structure identi- 
fies new genes-to-molecules correlations. In addition, all of the chemical and biological 
data generated during the fermentation, extraction, purification, and bioactivity screen- 
ing stages are fed back into the database and integrated with the genomics information. 
Sophisticated bioinformatics applications are then able to identify relationships between 
the diverse data sets that can be used to guide the production and purification of targeted 
metabolites — for example, by defining the fermentation media that are likely to sup- 
port the expression of a gene cluster and by identifying purification schemes that have 
proved successful with isolations from the specific medium and for similar structure 
types. 

3. Genomics-Guided Discovery: A Case Study 

3.1. Genome Scanning of Streptomyces aizunensis 

The untapped potential of microorganisms to produce bioactive NCEs is illustrated 
by our experience with the actinomycete Streptomyces aizunensis, a producer of the 
antibiotic bicyclomycin. The bicyclomycin gene cluster was targeted for isolation 
because the compound contains some unusual functional groups and the genes required 
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Fig. 3. The genomics-guided discovery platform developed at Ecopia, a new paradigm for 
natural product discovery. The platform identifies all of the natural-product gene clusters in a 
genome, predicts the chemical structures of the compounds they encode, and targets compounds 
that are likely to be NCEs for production and purification. 



to make these kinds of structures were not known. Surprisingly, genome scanning of 
the S. aizunensis genome identified 1 1 natural-product gene clusters in addition to the 
bicyclomycin cluster, even though this organism produced only bicyclomycin in pub- 
lished fermentation screening studies (4,5). One of the additional gene clusters was 
predicted to encode a compound similar to the known antibiotic streptothricin, based 
on computer-based comparisons to other clusters in the database. This information 
proved to be valuable, as a streptothricin-like compound was indeed detected during 
subsequent fermentation experiments. Knowing in advance that this compound would 
be produced made it very easy to identify and circumvent, while purifying other com- 
pounds from the fermentation broths. More importantly, the structure predictions gen- 
erated for the remaining 10 gene clusters did not match any compounds present in 
databases of known natural products, indicating that they are likely to encode NCEs. 
Each of these clusters presented an exciting opportunity for new compound discovery. 
In the following sections, we demonstrate how automated gene sequence analysis was 
used to predict the structure of an NCE encoded by one of these gene clusters and how 
genomics information was used to guide the purification of the compound. 

3.2. Automated Analysis of Gene Clusters and Chemical 
Structure Prediction 

All gene clusters identified by genome scanning enter a fully automated analysis 
cascade of specialized software applications that identify open reading frames (ORFs), 
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assign functions to each gene in the cluster, and predict chemical substructure ele- 
ments. The output of the automated analysis is displayed in an interactive graphical 
user interface designed to allow scientists to quickly assess the accuracy of the com- 
puter predictions and to determine whether a cluster is likely to encode a known com- 
pound or an NCE. The automated analysis of one of the S. aizunensis gene clusters, 
designated 023D, is shown in Fig. 4. In this cluster, the computer assigned 35 ORFs to 
protein families based on homology comparisons to proteins in the Decipher database. 
The disposition of these ORFs in the cluster is shown in window A of Fig. 4, where 
each ORF carries a four-letter code indicating the protein family to which it was 
assigned. Nine of the ORFs in the 023D cluster were designated as polyketide 
synthases (PKSs). PKSs and other multimodular protein families (such as nonribosomal 
peptide synthetases) are further processed by an automated software application that 
parses the proteins into individual enzymatic domains. Each domain sequence is then 
compared to a series of protein models of active domains to identify domains that are 
likely to be nonfunctional. Additional computer scripts are also invoked when particu- 
lar domains are encountered. For example, the substrate specificity of each acyltransferase 
(AT) domain is readily assigned by a phylogenetic comparison to AT domains of known 
specificity, while a similar analysis of thioesterase (TE) domains very effectively dis- 
tinguishes domains that generate linear polyketide products from those that catalyze 
the formation of cyclic products. The result is an automated "domain string" (displayed 
in Fig. 4, window C) that captures the structure of the polyketide backbone in a line 
notation that can then be translated into a chemical structure prediction (Fig. 4, win- 
dow B). The automated analysis of the 023D PKS system predicted a long, linear 
polyketide chain bearing polyene chromophores. While many cyclic (macrolide) poly- 
ene natural products are known, linear polyenes remain relatively rare. Chemical sub- 
structure searches using the predicted polyketide backbone identified no similar 
structures in natural-products databases, providing the first indication that the 023D 
gene cluster encoded a NCE. 

3.3. Correlating Genes With Chemical Substructures 

The "family string" generated by the analysis cascade (shown in window D of Fig. 
4) provides a representation of the cluster that is used in an automated search of the 
database to identify gene clusters with similar families. The structures of the com- 
pounds linked to these clusters are then compared to identify common structural ele- 
ments. For example, three gene clusters in the Decipher database contain the ADSN, 
AYTP, and CALB families found in the 023D cluster. Structure analysis of the corre- 
sponding compounds identified a single common structural element, a 2-amino-3- 
hydroxycyclopentenone (C 5 N) group in amide linkage to a polyketide carboxylate 
(Fig. 5, upper). Inspection of the computer-predicted function of each family sug- 
gested a plausible pathway for the biosynthesis of a C 5 N group from glycine and 5- 
aminolevulinic acid. Thus, the presence of these three genes in a cluster provides a 
marker for the presence of this functional group in a natural product. Similarly, com- 
puter analysis correlated four families in the 023D cluster with the presence of a four- 
carbon, amine-containing (C 4 N) polyketide starter unit (Fig. 5, middle) and five 
families with a 6-deoxyhexose sugar moiety (Fig. 5, lower). In both cases the predicted 
functions of the families suggested likely biosynthetic pathways and strongly supported 
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Fig. 4. Automated analysis of the 023D gene cluster: (A) overview of the automated gene 
finding and family calling; (B) predicted structure of the polyketide backbone; (C) automated 
domain string; (D) automated family string. 
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Fig. 5. Correlating genes and structures. An automated application identifies gene clusters hav- 
ing similar gene families (highlighted). The compounds produced by these gene clusters are then 
compared to identify common structural elements (boxed). This analysis identified three genes that 
correlate with the C 5 N structure element (upper), four genes that correlate with the C 4 N polyketide 
starter group (middle), and five genes that correlate with a 6-deoxyhexose group (lower). 
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the gene-structure correlations. The family- and domain-string notations demonstrate 
the utility of reducing gene-sequence information to a series of computable properties, 
or "genetic descriptors," that a computer can learn to associate with chemical-structure 
descriptors. As the number of sequenced gene clusters in the Decipher database climbs, 
the ability to predict chemical structure directly from gene sequence is increasingly 
refined. 

The results of the automated analysis cascade provided a very precise prediction of 
the structure of the compound encoded by the 023D gene cluster, as shown in Fig. 6. 
Tools for substructure searching are also integrated into the discovery platform and 
allow a scientist to quickly assess whether a predicted structure has already been re- 
ported, providing an early opportunity for in silico dereplication. For example, sub- 
structure searches of the AntiBase database (Wiley Publishers, 2003) of over 30,000 
microbial natural products revealed only 64 products that contain the C 5 N group. This 
example illustrates how even a small amount of structure information can greatly limit 
the number of structures that need to be considered as candidate products. More impor- 
tantly, the addition of a second structure element to the query returned no hits from the 
database, providing further evidence for the novelty of the compound encoded by the 
023D gene cluster (Fig. 6). 

3.4. Finding the Needle in a Haystack: Genomics-Guided Purification 

Having strong evidence for an NCE, the compound encoded by the 023D gene clus- 
ter was targeted for purification. The structure prediction immediately identified physi- 
cochemical properties or "handles" that could be used to guide the purification of the 
compound. For example, the compound was predicted to have a molecular mass in 
excess of 1290 Daltons (Da) and a distinctive UV spectrum imparted by the pentaene 
chromophore. To ensure that the gene cluster was expressed, S. aizunensis was grown 
in more than 50 different fermentation media in 25 -mL shake-flask cultures. Methanol 
extracts of each culture were subjected to high-performance liquid chromatography 
(HPLC)-UV-mass spectrometry (MS) analyses, and metabolites were monitored using 
a specially designed system that makes it possible to analyze HPLC fractions simulta- 
neously across all the different media conditions (Fig. 7, upper). An overview of the 
MS traces showed that the profile of metabolites varied considerably from medium to 
medium. The interface shown in Fig. 7 is fully interactive, so that the underlying chemi- 
cal data can be rapidly searched using queries that incorporate multiple physicochemi- 
cal parameters. For example, a mass filtering function allows the user to search all 
fractions for masses within a particular range. A search for metabolites having a mass 
greater that 1290 Da identified a single peak that appeared only in some fermentation 
media but not in others (Fig. 7, middle). Clicking on any fraction pops up a new win- 
dow that displays the full spectral data set for that fraction. When this was done for one 
of the peak fractions from the mass filtered search, the data revealed molecular ions 
consistent with a major isotope of mass 1296.7 Da and a UV absorption spectrum char- 
acteristic of a pentaene (Fig. 7, lower inset), fully consistent with the structure predic- 
tions generated by computer analysis. Thus, the chemical data provided strong evidence 
that the 1296. 7-Da metabolite corresponded to the compound encoded by the 023D 
gene cluster. In addition to the chemical data, aliquots of each HPLC fraction are rou- 
tinely tested for antimicrobial activity against a panel of bacterial and fungal patho- 
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gens, and the bioassay results can also be displayed along with the chemical data in the 
graphical interface. In Fig. 7 (middle), the screening data for Candida albicans is 
shown, as the fractions containing the 1296.7-Da compound exhibited a potent antifun- 
gal activity in the bioassay screens. 

Normally, the purification of a natural product from a complex fermentation mix- 
ture presents a formidable task that may take months to achieve. In this case, however, 
having the mass, UV, and bioactivity data in hand made the subsequent purification of 
the 1296.7-Da compound straightforward, and this was accomplished in a matter of 
days. Similarly, the elucidation of a complex natural-product structure using the stan- 
dard analytical techniques can be exceedingly difficult, but in this case it was greatly 
facilitated by the in silico structure prediction. The final structure was confirmed by 
multidimensional NMR spectrometry, and proved to be entirely consistent with the 
structure prediction generated by gene-sequence analysis (Fig. 8). The new compound, 
named ECO-02301, displayed potent in vitro activity against numerous fungal patho- 
gens, including Aspergillus fumigatus and azole-resistant strains of Candida albicans, 
and was shown to be efficacious in a mouse model of disseminated candidiasis, where 
treatment of infected mice resulted in a statistically significant increase of the median 
survival as compared to nontreated animals. ECO-02301 thus represents an exciting 
new chemical class of natural product and a promising agent for development as a 
treatment for serious fungal infections. 

4. Summary 

The discovery of ECO-02301 is one of the early successes of the genomics-guided 
platform developed at Ecopia and the new paradigm for natural-product discovery. The 
discovery of this compound from S. aizunensis provides direct evidence that the capac- 
ity for microorganisms to produce natural products has been greatly underestimated, 
and that exciting new natural products can be discovered from microorganisms that 
were already screened using traditional approaches. 
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Critical to the success of genomics-guided discovery are the database and data-min- 
ing tools that enable comparative analysis of gene clusters and the prediction of chemi- 
cal structures from gene-sequence information. With each new gene cluster added to 
the Decipher database, the structure-prediction capabilities of the platform become in- 
creasingly accurate. The integration of specialized applications to search for metabo- 
lites simultaneously across multiple growth conditions makes it possible to correlate 
the metabolome of an organism with the gene clusters discovered by genome scanning 
and to identify metabolites that are likely to be NCEs. 

Given that only a tiny fraction of the microbial world has been explored, the poten- 
tial for discovering new natural products is virtually unlimited. Increasingly, new com- 
pounds will be discovered at the computer, and bioinformatics technologies will be 
used to tailor strategies for their production and purification, thus overcoming many of 
the technical hurdles previously associated with natural-product discovery. With the 
introduction of powerful new genomics technologies, the remarkable fifty-year track 
record of microorganisms in producing landmark medicines is now poised to extend 
well into the new millennium. 



Fig. 7. (opposite page) Finding the predicted compound in fermentation broths. Upper, par- 
allel high-performance liquid chromatography (HPLC)-mass spectrometry analyses of extracts 
of S. aizunensis grown in multiple fermentation media. Each column represents a different 
fermentation medium, and each row in a column represents a single HPLC fraction. All fifty 
media can be analyzed simultaneously by scrolling across the window. Middle, the same inter- 
face, filtered for metabolites having a mass greater than 1290 Da, showing a single peak found 
in only some fermentation media; the Candida albicans screening data are also shown in this 
view (yellow highlights) as an antifungal activity correlated with the peak fractions. Clicking 
on a peak fraction pulls up a new window (lower inset) showing detailed spectroscopic data 
consistent with the predicted compound. 
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Pathway, and Genome Shuffling 
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Summary 

The development of an economically viable production processes is a significant hurdle in the 
commercialization of natural products. A primary method of achieving this goal is through strain 
engineering. Evolutionary engineering has been practiced for decades in the form of classic strain 
improvement. The process of genetic diversification and functional screening has now become a 
powerful means of improving the function of diverse biological systems from genes and enzymes to 
whole genomes. Gene shuffling is a method for effecting genetic diversification in evolutionary 
engineering programs that incorporates recombination into the evolutionary algorithm. This approach 
dramatically accelerates the process of directed evolution and is arguably the most robust method for 
the purposeful manipulation of biological structure and function. This chapter reviews the theory and 
practice of gene shuffling-mediated evolutionary engineering in the context of commercial strain 
improvement. Described are examples of the improvement of commercial natural-product fermenta- 
tion processes through the shuffling of individual genes, metabolic pathways, and whole genomes. 

Key Words: Fermentation; directed evolution; shuffling; natural products; Streptomyces; 
metabolic engineering. 

1. Introduction 

Despite the medical and commercial success of the class of compounds known as 
natural products, the investment in the discovery and the commercialization of new 
natural products has significantly decreased since its peak in the mid to late twentieth 
century. Explanations for this decline include the decrease in the discovery of new 
classes of natural compounds having interesting new activities, the advent of combina- 
torial chemistry, the desire of medicinal chemists to have rapid access to significant 
quantities of relatively pure compounds for chemical studies, and the time and expense 
of developing an economically viable process for the production of compounds for 
clinical and commercial use. Although many pharmaceutical companies have divested 
of their internal natural-products programs, smaller biotechnology companies are pick- 
ing up this effort and breathing new life into the field of natural-product discovery. 
New technologies include methods to identify and express cryptic secondary metabolic 
pathways (1), isolate the biosynthetic potential of "unculturable" organisms (2-4), the 
prospecting of marine microorganisms (5-7), and the various combinatorial approaches 
to exploit the metabolic potential of natural -product biosynthesis — i.e., combinatorial 
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biology (8), combinatorial biosynthesis (9,10), and combinatorial biocatalysis (11,12). 
Although these technologies are successfully expanding accessible chemical diversity, 
the financial hurdles of developing and commercializing newly discovered compounds 
remain. Sufficient quantities of pure compound must be available for development and 
clinical studies, and a final approved compound must be economically accessible to the 
consumer and profitable for the producer. The goal of this chapter is to discuss tech- 
nologies aimed at the improvement of process economics in the production of natural 
products; we will focus specifically on the application of evolutionary engineering tech- 
nologies for the improvement of fermentation-based bioprocesses. The theory and prac- 
tice of gene-shuffling technology shall be introduced, followed by commercially 
relevant examples of its application to the improvement of a specific gene in a native 
biosynthetic pathway, of engineered biosynthetic pathways, and of a whole fermenta- 
tion strain genome, each for the production of commercial natural products. 

1. 1. Economics of Production 

A comprehensive discussion of the economics of fermentation processes has been 
described previously (ref . 13, and references therein). Simply, the cost of goods (COG) 
for producing a fermented compound is primarily associated with its production in the 
fermenter and isolation from the resulting culture. These costs include raw materials, 
energy, capital investments, time, and labor, and are directly offset by the overall iso- 
lated yield of compound by the process. COG can thus be estimated as COG ($/kg) = 
(C f + Cj)/yP f , where C f is the cost associated with the fermentation, C ; is the cost asso- 
ciated with product isolation, P f is the mass of product produced by the fermentation, 
and y is the percent yield of the isolation process. Accordingly, the economics of pro- 
duction can be improved by either decreasing the costs associated with production and 
isolation (C f and Ci) or increasing the total process isolated yield (P f and/or y). Achiev- 
ing these economic goals is generally realized through iterations of process and bio- 
catalyst (strain) engineering. Process engineering is the manipulation of the physics 
and chemistry of the fermentation and isolation process (the environment to which the 
biological catalyst and its products are exposed), whereas strain engineering is the 
genetic manipulation of the biological catalyst to improve its performance under 
fermentation process conditions. The technologies described in this chapter are new 
tools to dramatically facilitate the strain-engineering aspect of fermentation process 
improvement. 

1.2. Strain Improvement 

In fermentation processes, a whole cell biological catalyst, such as a bacterium, fun- 
gus, or mammalian cell culture, is propagated in a medium composed of raw materials, 
which over time are converted to biomass, the product of interest, and byproducts. The 
total mass of product produced in the process is proportional to the mass of biocatalyst 
produced, the specific activity of the biocatalyst at different times throughout the fer- 
mentation, the time the biocatalyst is catalytically active, and the volume of the reactor. 
Since both the volume of the reactor and the time it is running are key contributors to 
the costs of the process, an important metric of process efficiency is the amount of 
product produced in a given volume over time, which is expressed as volumetric pro- 
ductivity (g'L'd 1 ). A process that runs for 10 d and results in 10 g/L of product has a 
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volumetric productivity of 1 gL _1 d _1 . An economically ideal process would be short, 
based on inexpensive raw materials, produce a large amount of product and little biom- 
ass or byproducts, and be run under conditions that support an equally inexpensive 
isolation process. Strain engineering is aimed at achieving this ideal through the genetic 
manipulation of the biocatalyst. 

The primary goal of strain engineering for the production of natural products is the 
improvement of volumetric productivity. Most natural strain isolates produce second- 
ary metabolites at very low levels (10-100 mg/L), while mature commercial strains 
can make significantly more than 10 g/L (14). There are in addition other performance 
criteria that may be targeted — for example, the purity of the products produced, ability 
to use inexpensive feedstocks, stability, low viscosity, and tolerance to optimal process 
conditions. In regards to purity, secondary metabolites often result from complex bio- 
synthetic pathways that produce not only the compound of interest but also structurally 
related "shunt products" that are difficult to remove in the isolation process. Decreas- 
ing the levels of these compounds in the fermentation can dramatically simplify the 
isolation process, result in an increase in isolation yield, and a corresponding decrease 
in the cost of the final product. 

Rationally engineering a commercially useful cellular phenotype is usually not 
simple. A great deal of information, genetic tools, and a strong model of the structure 
and function of the target cell are required. Most industrial organisms are significantly 
less well characterized then model academic systems (e.g., there are few biochemical 
pathway, genetic, or genomic data) and lack the technical prerequisites described above. 
This is especially true for emerging industrial organisms (such as those producing a 
new, promising natural product). This is not to say that excellent examples of success- 
ful metabolic engineering of industrial microorganisms do not exist. They do (15-19). 
However, they predominantly are for well-characterized systems. 

The primary approach to improving industrial organisms is classic strain improve- 
ment. This proven method of random mutagenesis and screening for improved strains 
remains the industry standard for commercial strain improvement, and has been thor- 
oughly reviewed (20-23). The beauty of this approach is its simplicity. It requires no 
molecular tools, metabolic models, or biochemical or genomic information, only an 
effective mutagen and an accurate screen for the desired phenotype. However robust, 
classic strain improvement is time and labor intensive, and can be the economic hurdle 
that limits commercial realization of a promising product or biological process. Even 
for well-characterized organisms, this approach remains the industrial method of choice 
for strain improvement. Indeed, this "black box" approach to genetic improvement is 
the foundation of evolutionary engineering. 

2. Evolutionary Engineering and DNA Shuffling 

Natural evolution is a recursive process of genetic diversification and functional 
selection (Fig. 1). Evolutionary engineering is a controlled process of evolution where 
both the genetic diversification and the functional selection are controlled in the lab. 
Evolutionary engineering has emerged as a complementary tool to rational design, and 
has proven to be one of the most reliable tools by which the function of biological 
systems can be purposefully manipulated. The approach does not rely on the applica- 
tion of structure-function relationships, but rather applies nature's own design algo- 





110 del Cardayre 

Diversification 
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Fig. 1. Evolution is a recursive process of genetic diversification and functional selection. 
"Evolutionary engineering" or "directed evolution" is an application of the evolutionary algo- 
rithm for the purposeful manipulation of biological systems in which the recursive cycles of 
genetic diversification and functional screening are controlled in the laboratory. 



rithm. Large populations of genetically diverse individuals are generated and then 
screened for a desired function, with those having acquired the desired function surviv- 
ing to parent the next population of diverse offspring (just as in classical strain im- 
provement). This recursive approach to biological systems engineering asks the 
question "which structure supports a desired function?" as opposed to "does this pre- 
conceived structure support a desired function?" In this way, the functions of complex 
biological systems are routinely tailored without relying on rational structural designs 
or simplified models. 

2. 1. Genetic Diversification 

Next to the classical breeding of plants and animals, classical strain improvement is 
perhaps the oldest application of evolutionary engineering. This process relies on re- 
cursive random mutagenesis of entire living genomes. In the late 1980s and early 1990s, 
the directed evolution of single genes emerged (24-26), in which genes were randomly 
mutated by error-prone polymerase chain reaction (PCR) or direct synthesis, and then 
screened for improved function. This very successful approach turned out to be a major 
step in the ability to routinely improve the function of biological macromolecules. 
However, this process of sequential random mutagenesis and screening, like classic 
strain improvement, lacks recombination, which is a key component and catalyst of the 
evolution algorithm. In nature, recombination, through sexual mating and horizontal 
gene transfer, facilitates both the amplification and the consolidation of the useful ge- 
netic information within a population. Recombination between members of a popula- 
tion allows the genetic potential of the entire population to be exploited in the evolution 
process, with useful mutations from individuals congregating and deleterious muta- 
tions being segregated over time. In essence, recombination facilitates information ex- 
change in a learning system. In the absence of recombination, these benefits are lost. 
Individuals carrying useful mutations within a population cannot benefit from each 
other (through breeding) and must evolve independently (randomly acquiring new 
mutations). Mutations arise linearly and may need to be rediscovered. Useful genetic 
diversity can be discarded, as only one or a few improved genetic variants from a popu- 
lation can advance to further rounds of diversification. Finally, deleterious mutations 
can accumulate over time, eroding the benefit of accumulated beneficial mutations. As 
a result, evolution in the absence of recombination can be significantly slower and less 
efficient (Fig. 2a). 
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In 1994, Stemmer introduced DNA-shuffling technology (27,28). DNA shuffling 
provides a means to breed a population of DNA fragments in vitro or in vivo (Fig. 2b). 
The shuffling reaction produces from a starting population of genes a combinatorial 
library of new genes containing permutations and combinations of the genetic diversity 
originating in the starting population (e.g., if a population of genes contain 10 random 
mutations that affect functional improvement, there are 2 10 = 1024 possible combina- 
tions of these mutations that could be generated through their shuffling). Unlike the 
results of classical breeding, the progeny of shuffled libraries can have genetic material 
from all of the parents that were shuffled (as opposed to only two in the case of classi- 
cal breeding). These shuffled libraries in general contain new genes that have signifi- 
cant functional improvements over the parental sequences. Because the library is not a 
library of random mutations, which primarily are deleterious or neutral in function, but 
rather a library of highly functional combinatorial mutants, improved genes can be 
identified from screening a relatively small sample of the library. It has since been 
demonstrated that DNA shuffling in combination with high-throughput screening 
(HTS) dramatically accelerates the process of evolutionary engineering and is appli- 
cable to a diversity of biological phenomena of both commercial and academic inter- 
est. 

DNA shuffling as a technology has significantly evolved since it was first reported 
in 1994. Typical approaches to gene shuffling now include multi-gene shuffling (i.e., 
the shuffling of homologous genes from different species) (29,30), synthetic shuffling 
(31), and in silico shuffling (32,33), all of which can be further enhanced using statis- 
tical and bioinformatic tools. Successful applications are numerous in both industry 
and academia, and include improved enzyme activity, selectivity, and stability, im- 
proved flux through metabolic pathways, and improved operators, vectors, antibodies, 
protein therapeutics, and vaccines. A recent advance in shuffling technology is its ap- 
plication to whole genomes (34,35). Genome shuffling, which shall be discussed in 
detail in Subheading 5.1., is an improved means of effecting classic strain improve- 
ment that incorporates recombination into the mutagenesis and screening algorithm, 
allowing the strain improvement process to proceed at an accelerated pace. 

2.2. Screening 

The two components of evolution are genetic diversification and functional selec- 
tion. The process of functional selection can be the greatest technical challenge of any 
evolutionary engineering effort, since in the lab "selection" generally is effected 
through a screen. If a screen does not measure the commercial performance criteria, it 
may identify only laboratory novelties, but never a truly improved variant that can be 
commercialized. As most HTS relies on the common 96-well plate format, it is almost 
impossible to develop a screen that replicates commercial process conditions. Accord- 
ingly, a great deal of industrial evolutionary engineering relies on quantitatively under- 
standing commercial process conditions and the desired performance criteria of the 
catalyst being evolved, and then devising methods to mimic these at small scale such 
that real performance can be measured. 

A screen has two components: the reaction that needs improvement, such as a 
biocatalytic conversion or fermentation, and the analysis of that reaction, such as the 
quantification of the product and its purity in the reaction mixture. The reaction itself is 
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Fig. 2. (A) Recombination is a catalyst of the evolutionary process. Asexual evolution is a 
recursive process of random mutagenesis and functional selection in which improved mutants 
must acquire new mutations randomly in a linear fashion, i.e., 10 rounds of mutagenesis and 
selection will result in about 10 new mutations. Sexual evolution incorporates recombination 
into this process and results in the exchange of information within a population, such that 
improved mutants within a population can acquire new useful mutations from each other. 
Recombination amplifies the useful genetic diversity of a population in combinatorial manner, 
i.e., if there are 10 useful mutations in a population there are 2 10 = 1024 combinations of those 
mutations that can result from their recombination. Recombination also harnesses the genetic 
diversity of a population by allowing mutations to become linked to each other and thereby 
distributed more broadly throughout a selected population. Useful mutations are combined in 
an exponential manner, i.e., 3 rounds of recombination can result in the accumulation of 2 3 = 8 
mutations. Finally, recombination provides a means for useful mutations to become segregated 
from deleterious mutations, which would be difficult to lose by random mutagenesis. Accord- 
ingly, sexual evolution proceeds faster than asexual evolution. (B) DNA shuffling combines 
the practices of classical breeding and molecular biology to allow one to breed a population of 
DNA molecules in vitro. In one shuffling format, genes are randomly fragmented and then 
reassembled in a primerless polymerase chain reaction. During each thermal cycle, single- 
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the most difficult to design, as it should mimic commercial conditions as closely as 
possible. An exact replication of the physical and chemical parameters under which a 
process is run should be scaled down as much as possible — in general, to <1 mL of 
reaction mixture. These reaction conditions must then be accurately reproduced for 
each of the genetic variants being compared, such that any variation in the measured 
productivity of a reaction reflects the performance of the catalyst, not a variation in the 
reaction conditions. Indeed, the variability of the system limits the ability to identify 
statistically relevant improvements. Analysis of the completed reactions is generally 
easier to solve, assuming that the right equipment is available. Most analytical methods 
are now amenable to high-throughput applications. UV/Vis and fluorescence spectros- 
copy are routinely used for the analysis of 96-well-plate-mediated reactions. Mass spec- 
troscopy has become a very sensitive high-throughput workhorse, with MS/MS 
providing very precise chemical data (36). Unfortunately, most chromatographic meth- 
ods, such as HPLC and gas chromatography (GC), remain only medium throughput 
(approx 200 samples per day per column) for most applications. Ninety-six parallel 
capillary electrophoresis, however, is now proving to be a powerful means of quickly 
separating and quantifying the components in complex mixtures, such as natural- 
product fermentation broths and mixtures of enantiomers (37). Once quantitative data 
are produced, however, rigorous statistical analyses should be applied to identify whether 
any genetic variants are truly improved (23). 

2.3. High-Throughput Fermentation 

In the case of fermentation strain improvement, generating a quality screen is a par- 
ticularly difficult challenge. Commercial fermenters can range from 1 to 200 m 3 , sup- 
porting cell growth to very high density. The media are complex and often contain 
solids and oils. Most fermentations are controlled at specific temperature, pH, and dis- 
solved oxygen, and may have controlled feeding of important raw materials and nutri- 
ents. The resulting fermentation broths are a complex mixture of compounds that 
generally require significant processing before pure compound is isolated. Tradition- 
ally, strain improvement has relied on small-scale fermentations carried out in shake 
flasks or test tubes and then analyzed chromatographically. However, these methods 
are now being adapted to 96-well format (34). Dispensing particulate medium contain- 
ing oils into 96-well plates is feasible using automated liquid handling, such as a 
Multimek (Beckman-Coulter) outfitted with wide-bore pipet tips (Robbins), which can 
uniformly dispense media containing large particles. Often buffers incorporated into 
the media can provide needed pH control without undesired effects on culture perfor- 



(Fig. 2. continued) stranded fragments from one gene hybridize to homologous regions of other 
genes and prime the DNA polymerization reaction, resulting in DNA fragments containing 
genetic material from at least two of the original genes. After numerous thermal cycles, com- 
plete chimeric genes are synthesized, with the final population representing a combinatorial 
library of the original starting genetic diversity. This library is then screened for new genes 
having desired properties. The process is then reiterated until the specific performance is 
achieved. 
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mance. Inoculation of seed cultures can be achieved through automated colony pick- 
ers, such as the Q-bot (Genetix, UK), transfer and assays can be accomplished using 
automated liquid-handling systems and robotics, and fermentations can be carried out 
in 96-, 48-, and 24-well plates vigorously agitated in controlled-environment shakers, 
which maintain uniform temperature and high humidity to prevent evaporation. The 
challenge is primarily for mycelial organisms, where high cell densities result in vis- 
cous cultures that require high oxygen transfer rates. Research to identify conditions 
for optimal delivery of oxygen has been described (38^40). In addition, new technolo- 
gies are emerging to support 96-well fermentations with controlled dO, pH, and tem- 
perature (41); however, these have not yet reached commercial launch. 

3. Gene Shuffling and Enzyme Evolution 

There are countless examples of the application of gene shuffling to the improve- 
ment of enzymes in the primary literature, and these are routinely reviewed (42,43). 
Industrial performance criteria for enzymes that have been improved by these methods 
include expression, stability, catalytic tolerance to diverse and extreme environments, 
catalytic activity, and substrate selectivity, including alterations in chemical, regio- 
chemical, and enantiomeric selectivity. Most literature examples describe the improve- 
ment of in vitro enzyme activity. However, for enzymes involved in the biosynthesis of 
a fermented natural product, these same characteristics expressed in vivo can signifi- 
cantly influence the volumetric productivity of a fermentation and isolation process 
(44). Targeting biosynthetic enzymes for evolutionary engineering is a method of 
rational metabolic engineering that requires knowledge of the biosynthetic pathway, 
limiting steps, genes encoding the targeted enzymes, and genetic tools for their ma- 
nipulation. Below are examples of how DNA shuffling was applied in this manner to 
improve the production of natural products. 

3.1. Gene Shuffling of aveC for Improved Commercial Production 
of Doramectin 

Avermectin and its analogs are biosynthesized by the actinomycete Streptomyces 
avermitilis and are today the most widely used drugs in animal health and agriculture, 
with current worldwide sales exceeding $1 billion. S. avermitilis normally produces 
eight distinct but closely related avermectins. Through strain improvement efforts via 
random mutagenesis and the use of exogenously supplied fatty acids, the avermectin 
analog doramectin (CHC-B1) was developed. Doramectin, which is sold commercially 
as Dectomax™, is co-produced during fermentation with the undesired analog CHC- 
B2. CHC-B1 and CHC-B2 are structural homologs, which differ only by the hydration 
of the C22-C23 methylene bond (Fig. 3), and are derived from avermectin Bl and B2. 
CHC-B2 must be removed through the isolation process. An economic improvement in 
the process would be achieved if the CHC-B1:CHC-B2 ratio were improved without 
compromise of the final CHC-B1 titer. 

aveC was identified as a gene that affected the ratio of avermectin Bl and B2 pro- 
duced in fermentations (45, 46). The gene was cloned by complementation from a clas- 
sically improved strain of S. avermitilis that demonstrated an altered B1:B2 ratio in 
fermentation broths. The aveC gene has no close homologs or known function at this 
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Fig. 3. Structure of avermectins. Avermectin Bl and B2 differ from each other only by the 
chemistry of the C22-C23 bond. Bl has a cis double bond, whereas B2 has this bond hydrated 
at the C23 position. Doramectin is derived from B 1 . Avermectin R = C 2 H 5 , doramectin R = 



time. S. avermitilis disrupted in aveC produces only background levels of avermectin. 
Stutzman-Engwall et al. investigated whether DNA shuffling of the aveC gene might 
result in new variants that would confer an ability to produce improved ratios of 
avermectins Bl and B2, and hence a corresponding improvement in the production of 
CHC-B1:CHC-B2 in commercial doramectin processes (36). 

The aveC gene was first randomly diversified by mutagenic PCR, and the gene vari- 
ants were cloned and expressed in S. avermitilis. Extracts from the variants, grown in 
high-throughput solid-phase fermentations containing authentic fermentation media 
components were then analyzed by an HTP MS/MS assay that quantified the total 
avermectin Bl and B2 produced. From this library, mutant aveC genes were identified 
that conferred upon S. avermitilis an improved B1:B2 ratio. Whereas the wild-type 
aveC gene resulted in a B1:B2 ratio of 1:1.6, an evolved aveC containing two muta- 
tions (D48E/A89T) resulted in a B 1 :B2 ratio of 1 :0.4, a fourfold improvement that also 
resulted in a 40% increase in Bl production under commercial conditions (Fig. 4). 

Additional evolution was pursued to further increase the B1:B2 ratio. Using the 
aveC (D48E/A89T) gene as backbone, three iterations of DNA semi-synthetic shuf- 
fling and HTS were carried out. From a population of significantly improved variants, 
an evolved aveC mutant containing 10 mutations was identified that resulted in a Bl :B2 
ratio of 1:0.07, a 23-fold improvement (Fig. 4) (47)\ The significance of this study is 
dramatic. A gene whose contribution to the biosynthesis of a natural product remains 
unknown, was engineered to effect a >20-fold improvement in the commercial perfor- 
mance criteria. This milestone in the engineering of natural-product biosynthetic path- 
ways emphasizes the industrial importance of evolutionary engineering and in 
particular DNA shuffling as an engineering tool. 
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Fig. 4. Improvements in doramectin process. Initial production strains of S. avermitilis pro- 
duced a CHC-B1:CHC-B2 ratio of 0.6:1. A new strain carrying an aveC gene improved by 
evolutionary engineering produced a B1:B2 ratio of 2.5:1, and has been commercialized. After 
three additional rounds of aveC shuffling, a new strain was generated that produced a ratio 
B1:B2 of 15:1, a 23-fold improvement (36,47). 



3.2. Gene Shuffling to Enhance Production of Shikimate 
From Escherichia coli 

Shikimic acid is an intermediate in the aromatic amino acid and vitamin pathway, a 
chemical intermediate in the synthesis of the anti-influenza drug Tamilfu®, and its bio- 
synthetic pathway contributes intermediates to a variety of natural products, such as 
rifamycin, rapamycin, and tacrolimus (FK506). The first committed step in shikimate 
biosynthesis in Escherichia coli is the feedback-regulated condensation of phospho- 
enolpyruvate (PEP) and D-erythrose-4-phosphate (E4P) by 3-deoxy-D-arabino- 
heptulosonate (DAHP) synthase. The shikimate pathway must compete with the 
PEP-dependent carbohydrate phosphotransferase (PTS) system for microbial transport 
and phosphorylation of glucose. Frost and colleagues hypothesized that improved flux 
through the shikimate pathway could be achieved if the condensation of pyruvate with 
E4P could be engineered to replace DAHP synthase as the first committed step in the 
shikimate pathway. To this end, Ran et al. demonstrated that 2-keto-3-deoxy-6- 
phosphogalactonate (KDPGal) aldolase could catalyze the condensation of pyruvate 
and E4P, and support the in vitro synthesis of 3-dehydroshikimic acid when incubated 
with pyruvate, E4P, 3-dehydroquinate synthase, and 3-dehydroquinate dehydratase. 
They then demonstrated that dgoA (which encodes KDPGal aldolase) from both E. coli 
and Klebsiella pneumoniae complemented in part E. coli strains having the genes 
encoding all isoforms of the DAHP synthase knocked out (48). 

To improve flux through this engineered shikimate pathway, the dgoA genes were 
improved by evolutionary engineering (48). Both the E. coli and the K. pneumonia 
dgoA genes were improved through two rounds of error-prone PCR mutagenesis fol- 
lowed by one round of DNA shuffling. Improved genes were identified by a nutritional 
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selection having increased stringency in each cycle of directed evolution (i.e., that 
required dgoA variants to support growth on progressively lower concentrations of 
shikimate pathway downstream products). The best-improved genes contained four to 
nine amino acid changes, demonstrated four- to eightfold increase in DAHP synthase 
activity and a 7- to 30-fold decrease in KDPGal aldolase activity relative the parental 
enzymes. To compare the in vivo flux through the shikimate pathway, the encoding 
constructs of the improved and parental genes were expressed in E. coll NR7 (E. coli 
KL3 with all DAHP synthase isozymes genetically inactivated). 3-Dehydroshikimic 
acid production from these strains was then monitored in controlled stirred-tank fer- 
mentations. While the K. pneumoniae dgoA gene supported only background produc- 
tion of 3-dehydroshikimate, its best progeny supported 8.3 g/L. Similarly, the E. coli 
dgoA gene supported 2 g/L, while its best progeny supported 12 g/L. This is an elegant 
example of the combination of rational metabolic engineering and evolutionary engi- 
neering to dramatically improve the flux to a natural product, and represents a new 
paradigm for overcoming PEP availability. 

3.3. Pathway Shuffling and Evolution 

An important goal in natural-product production is the engineering of new meta- 
bolic pathways into fermentation organisms. Often a desired compound is not pro- 
duced by a known organism, but a pathway comprised of host and recombinantly 
expressed genes can be constructed that enable the desired biosynthesis. Although this 
approach has met certain success at creating new strains making desirable compounds, 
often the compound is not produced at commercially viable levels. In these cases, it is 
likely that the engineered pathway contains the metabolic bottleneck, and the flux 
through that pathway needs to be improved. Again, evolutionary engineering is a prag- 
matic approach to improving performance. The DNA encoding a part or the entire 
metabolic pathway can be subjected to DNA shuffling, and host organisms expressing 
the pathway variants can then be screened for improved compound production. Evolu- 
tion in this context allows the individual genes to become better expressed, the en- 
coded recombinant enzymes to become optimized to function in the new host 
environment in a more balanced fashion, and catalytically limiting enzymatic steps to 
be improved. This approach has been successfully demonstrated in the adaptation and 
improvement of several metabolic pathways, including the arsenate detoxification path- 
way from Staphylococcus aureus expressed in E. coli (49), an engineered pathway for 
production of adipyl-7-amino deacetoxycephalosporanic acid (Tobin et al., unpublished 
results), the mevalonic acid pathway from Saccharomyces cerevisiae expressed in E. 
coli for improved fermentative terpene production (Chatterjee et al., unpublished data), 
and for the production of an intermediate in a new synthesis of the flavor compound 
strawberry furanone (50). 

4. Genome Shuffling and Organism Evolution 

Despite successes in the rational and targeted engineering of fermentation strains, 
the primary means of improving even well-characterized and mature industrial organ- 
isms remains classical strain improvement. This is because most whole-cell pheno- 
types are by nature complex, and it is difficult to model or identify which genes in the 
dynamic cellular machine may be limiting. By evolving an entire genome, no assump- 
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tions as to what is limiting are made, and an organism can be improved through the 
evolution of its entire genome. As described previously, classical strain improvement 
suffers from being time and labor intensive. In addition, it is an asexual process, which 
has the associated limitations of accumulating beneficial and difficult to lose deleteri- 
ous mutations in a linear fashion. In 2001, Zhang et al. introduced whole-genome shuf- 
fling (34), By incorporating recombination into the recursive cycles of mutagenesis 
and screening of classical strain improvement, Zhang et al. were able to overcome 
some of the limitations of the classical method and dramatically accelerate the rate of 
strain improvement. 

4. 1. Genome Shuffling 

Genome shuffling combines the advantage of multiparental crossing that is allowed 
by DNA shuffling with the recombination of entire genomes normally associated with 
conventional breeding. While in vitro recombination of genes (<10 kb) is routine, it is 
not for whole genomes (»1 Mb). In vivo methods for effecting recombination prima- 
rily are based on natural mechanisms and result in low recombination efficiencies (<10~ 5 ). 
Hopwood (57) and Baltz (52) described the application of protoplast fusion technology 
as an efficient means of obtaining efficient recombination (>10%) between strains of 
Streptomyces. Hopwood and Wright later demonstrated that a single protoplast fusion 
could also result in recombination between up to four strains of Streptomyces 
coelicolor, although with very low efficiency (approx 10" 6 ) (53). It was proposed that 
this tool could be used to breed cell lines resulting from divergent lineages in a strain- 
improvement program (54,55). However, the technology never became popular in this 
regard, likely because a single fusion between divergent lines resulted in new strains 
that performed poorly relative to their parents. Indeed this is what would be expected, 
as the beneficial mutations from divergent cell lines would be diluted as a result of a 
single cross between distantly related strains of a lineage — i.e., if two strains, each 
having 10 unique mutations, were crossed, the average recombinant would still have 
10 mutations, but 5 mutations would originate from each line. The majority of muta- 
tions would have to be complementary for improvement to be expected from a signifi- 
cant portion of the population resulting from a cross. Zhang et al. applied the fact that 
shuffling is a recursive process of pair-wise recombination; in vitro shuffling is a pro- 
cess of recursive pairwise recombination between DNA molecules that occurs during 
each thermal cycle of the shuffling reaction (27). Taking advantage of the high pair- 
wise recombination efficiencies of protoplast fusion, they demonstrated that recur- 
sive protoplast fusion (Fig. 5) of a population of different cells resulted in the 
shuffling of their genomes. This was verified by recursively fusing the same four 
multiply marked strains of S. coelicolor used by Hopwood and Wright (50). The 
recursively fused population, relative to the singly fused population, had a 10 5 -fold 
increase in recombinants having genetic markers from the four original parents (34). 
This significant increase in "complex progeny" suggested that one could shuffle a 
population of genetically diverse improved strains, such as improved siblings from a 
classical improvement program, and, through screening a small population, identify 
improved strains containing mutations from many of the parental breeding stock. An 
example of the successful application of this approach to strain improvement is described 
in Subheading 5.2. 
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Fig. 5. Genome shuffling by recursive protoplast fusion. A population of cells to be shuffled 
is identified, and protoplasts from each strain are prepared. These are then combined and fused 
as a mixed population and allowed to regenerate. In general, 10% of the population of regener- 
ated cells are pair-wise recombinants, i.e., they have genetic information from any two parents; 
but very few cells have genetic information from three or more parents. Recursive fusion of the 
regenerated protoplasts allow the 10% recombinants from each round to recombine with differ- 
ent parents or with each other, such that a population regenerated from recursive fusions can 
contain >2% cells that have genetic information from any four parents, hence representing a 
genome-shuffled library (34). When applied to strain development, the starting population of 
cells would be strains that each have some desired but genetically distinct property. 



4.2. Improved Production of Tylosin by Genome Shuffling 
of Streptomyces fradiae 

Tylosin (Fig. 6) is a complex polyketide antibiotic sold for veterinary health and 
nutrition that is isolated from the fermentation broths of S. fradiae. The biosynthesis of 
tylosin requires a biosynthetic cluster of approx 100 kb in addition to numerous genes 
distributed throughout the S. fradiae genome. Eli Lilly initiated in the early 1960s an 
extensive strain improvement program to improve tylosin production from S. fradiae. 
The program began with strain SF1, a stable derivative of the natural isolate obtained 
through natural selection. This strain was taken through 20 rounds of classical strain 
improvement (CSI) for improved tylosin production, resulting in the commercial pro- 
duction strain SF21. 

Zhang et al. employed this system to demonstrate that genome shuffling could a- 
ccelerate the strain-improvement process. They propogated 22,000 MNNG-induced 
mutants of SF1 in 96-well two-stage mini-fermentations. The mutants were screened 
for improved tylosin production using a two-tier spectrophotometric and HPLC assay. 
Eleven strains were identified and used as breeding stock for genome shuffling by 
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recursive protoplast fusion (Fig. 5). After two rounds of genome shuffling and screen- 
ing only 2000 mutants, the group identified new shuffled strains that produced tylosin 
at levels equivalent to SF21 in shake-flask cultures (Figs. 7, 8). A molecular and mor- 
phological comparison of GS1 and GS2 to SF1 and SF21 demonstrated that GS1 and 
GS2 were morphologically similar to SF1, but shared a functionally similar but struc- 
turally distinct mutation in a regulatory gene (tylQ) shared by SF21. This study demon- 
strated that the shuffling of the improved population resulting from one round of CSI 
could result in strains that performed as well as those resulting from 20 rounds of CSI. 
As a practical comparison, this represents the difference between 20 years and over a 
million screens for CSI as compared to 1 yr and less than 25,000 screens for genome 
shuffling (Fig. 8). The study also demonstrated that the diverse mutations discovered 
early in a strain-improvement program are important, and that only a few are actually 
required to result in significant improvement. In this example, the genome-shuffled 
strains would have had no more than 1 1 mutations (and likely less). Indeed, this may 
explain the similar morphology of SF1 and the shuffled strains GS1 and GS2 as com- 
pared to the much smaller SF21 (Fig. 8). 

This study demonstrates that commercially significant strain improvement can be 
achieved in a single year, a timeline that significantly decreases the resource and eco- 
nomic burden normally associated with strain improvement. From the perspective of 
developing new natural products, this technology may decrease the titer hurdle that 
developing new strains must face to be commercialized. Regarding the further im- 
provement of mature strains, it suggests that the shuffling of archived strains, such as 
those divergent strains from a lineage that were not ultimately pursued, may be a pow- 
erful means of taking advantage of previous screening efforts and accessible diversity 
to generate new and significantly improved strains. Indeed genome shuffling may 
enable the scenario originally proposed by Hopwood and Chater (54), in which the 
mutations from divergent lineages could be combinatorially explored through proto- 
plast fusion. 

5. Summary 

Natural products remain an important and lucrative source of chemical and func- 
tional diversity. New technologies aimed at further harnessing the potential of these 
compounds are still at their infancy, and their promise for generating useful new mol- 
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Fig. 7. Genome shuffling of S. fradiae for rapid improvement of tylosin production, from 
Zhang et al. (34). (A) A naive strain of S. fradiae, SF1, was mutagenized with NTG, and 22,000 
derived mutants were screened by high-throughput fermentation (300 pL) for improved tylosin 
production. Eleven of the improved strains identified were recombined by one round of ge- 
nome shuffling (Fig. 5), and 1000 individuals from the shuffled library were screened for fur- 
ther improvements. Seven new strains that demonstrated significant improvements over the 
NTG mutants were then recombined by another round of genome shuffling. Screening of this 
population identified another seven strains that had even further improvements. These sup- 
ported tylosin production in the small-scale fermentation that was superior to a commercial 
production strain, SF21, that was derived from more than 20 yr of classical strain improvement. 
(B) Relative tylosin production in shake-flask fermentations by strains from the second round 
of genome shuffling, GS1 and GS2, as compared to SF1 and SF21. 



ecules remains. However, new natural products will continue to face the economic 
hurdles and impatient timelines of the pharmaceutical industry. The challenge to pro- 
duce significant quantities of compound quickly for development and clinical studies, 
and to economically produce the final product, remains the responsibility of strain and 
process engineers. Evolutionary engineering is now a proven and robust tool for the 
purposeful manipulation of biological systems, and provides an important new tool for 
the strain engineer. Gene shuffling in combination with high-throughput screening is 
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Fig. 8. Comparison of S. fradiae improvement by genome shuffling and classical strain 
improvement. SF21 was produced from strain SF1 by 20 rounds of classic strain improvement 
involving over a million screens and 20 yr of effort. GS1 and GS2 were also generated from 
SF1 by one round of classic strain improvement and two rounds of genome shuffling involving 
24,000 screens and 1 year. Colony morphologies of SF1, SF21, and GS1 highlight the differ- 
ence in mutational loads of SF21 and GS1 relative to SF1. 



arguably the most effective means of achieving successful and efficient evolutionary 
engineering of genes, enzymes, metabolic pathways, and whole genomes. The technol- 
ogy has already realized notable commercial success in the improvement of estab- 
lished natural-product-producing microorganisms. Might this technology facilitate a 
resurgence of natural products in the drug discovery and development process? Hope- 
fully so. 
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Summary 

Since the early 1940s, the search for agents that may treat or ameliorate the scourge of cancer 
has involved all aspects of chemistry and pharmacology. Throughout these years, natural products 
have played an extremely important role as first the major source of drugs used for direct treatment, 
as scaffolds upon which chemists would practice their skill, and now as modulators of specific cellular 
pathways in the tumor cell. Even today, over 60% of the 140 plus agents currently available in Western 
medicine can trace their provenance to a natural-product source. 

Key Words: Natural products; cancer drug discovery. 

1. Introduction 

The original antitumor drugs that went into clinical usage before and after WWII 
were synthetic agents that were originally based upon the so-called nitrogen mustards 
used as blister gases during WWI, and effectively acted as acylating agents modifying 
the bases within the DNA helices of both normal and cancerous cells. 

Following WWII and the initial investigations of the microbial world for antibiotics, 
came the realization that a significant number of microbial products with nominal anti- 
biotic activity were also effective against eukaryotic cells from both fungi and mam- 
mals. In addition, there was some anecdotal evidence that plant secondary metabolites 
also demonstrated activity against mammalian cells, including tumor cells. Thus, from 
roughly the early 1950s to date, the systematic investigation of plant and microbial 
products for their potential as antitumor agents has been carried out worldwide, with a 
large amount of the funding coming from the US National Cancer Institute in one form 
or another. 

To put these results into perspective, of the 140-plus drugs that have been approved 
worldwide from the early 1940s through the end of 2002 for the treatment of the mani- 
fold diseases that fall under the generic heading of cancer, the majority are "other than 
true synthetics." What this statement means is that of the 126 small molecules identi- 
fied by excluding biologicals, vaccines, and hematological/immunological stimulants, 
the number of nonsynthetic agents is either 77 (62%) or 85 (67%), depending upon the 
use of definitions. If one uses the simple definitions listed in Table 1 (i.e., N, ND, and 
S*) then the lesser figure applies, whereas if the "NM" category is included, the greater 
figure applies. 

From: Natural Products: Drug Discovery and Therapeutic Medicine 
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Table 1 

Definitions of Drug Sources 

Major categorization 

"N" Natural product. 

"ND" Derived from a natural product and is usually a semi-synthetic modification. 
"S" Totally synthetic drug, often found by random screening/modification of an exist- 
ing agent. 
"S*" Made by total synthesis, but the pharmacophore is/was from a natural product. 

Minor categorization 

"NM" Natural product mimic 



In Table 2, in addition to showing the formal designation as to pharmacophoric 
source, we also show the actual biological source (plant or microbe) of the "other than 
synthetic molecules." 

2. Prokaryotes (Other Than Marine-Derived) 
2. 1. Approved Agents 

If one inspects Table 2, then the initial important involvement of the formal micro- 
bial world in cancer chemotherapy can be seen, with 21 of 126 (or 17%) of the formal 
natural products/natural product-derived categories from a chemical-source perspec- 
tive falling into this biological category. 

The first of the microbial compounds to be used systematically was named generi- 
cally as "Dactinomycin." Chemically, it is named D-actinomycin C 1; and the common 
usage is actinomycin D (Structure 1). This was first introduced in the early 1960s and 
has been the subject of very significant chemical synthetic and semisynthetic programs 
in many parts of the world since its introduction. However, to date, no derivatives have 
been approved for clinical use outside of experimental protocols. In recent years, there 
has been a series of reports that actinomycin D has activities related to perturbation of 
the signal transduction cascade(s) distinct from its well-known intercalating ability, 
and it will be interesting to see whether these activities rejuvenate interest in this class 
of molecules (1-5). 

One of the most important classes of compounds derived from the order Actinomyc- 
etales (and this is the formal source of most microbial-derived materials) is that known 
generically as the anthracyclines, with one of the most useful being daunorubicin (Struc- 
ture 2) and its derivative doxorubicin (adriamycin) (Structure 3), which even today is 
still a major component of the treatment regimen for breast cancer. Although there 
have been many similar molecules isolated and described in the literature, it is doxoru- 
bicin and its more modern derivatives such as epirubicin (Structure 4), pirirubicin 
(Structure 5), idarubicin (Structure 6), and, more recently, valrubicin (Structure 7) that 
have been approved for cancer treatment. The mechanism of action (MOA) of these 
molecules, aside from their formal identification as intercalators into the DNA helix, is 
now known to be inhibition of topoisomerase II, one of the important enzymes in the 
replication pathway of DNA during cell cycling. 
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Table 2 

All Small Molecule Anticancer Drugs (1940s-2002) 

(Organized Alphabetically by Generic Name Within Chemical Source) 



Generic name 


Year introduced 


Chem-source 


Bio-source 


aclarubicin 


1981 


N 


M 


actinomycin D 


Pre- 1981 


N 


M 


angiotensin II 


1994 


N 


A 


arglabin 


1999 


N 


M 


asparaginase 


Pre- 1981 


N 


M 


BEC 


1989 


N 


P 


bleomycin 


Pre- 1981 


N 


M 


daunomycin 


Pre- 1981 


N 


M 


doxorubicin 


Pre- 1981 


N 


M 


masoprocol 


1992 


N 


P 


mithramycin 


Pre- 1981 


N 


M 


mitomycin C 


Pre- 1981 


N 


M 


paclitaxel 


1993 


N 


P 


pentostatin 


1992 


N 


M 


peplomycin 


1981 


N 


M 


solamargine 


1987 


N 


P 


streptozocin 


Pre- 1981 


N 


M 


testosterone 


Pre- 1981 


N 


A 


vinblastine 


Pre- 1981 


N 


P 


vincristine 


Pre- 1981 


N 


P 


alitretinoin 


1999 


ND 


P 


amrubicin hydrochloride 


2002 


ND 


M 


cladribine" 


1993 


ND 


M* 


cytarabine ocfosfate" 


1993 


ND 


M* 


docetaxel 


1995 


ND 


P 


dromostanolone 


Pre- 1981 


ND 


A 


elliptinium acetate 


1983 


ND 


P 


epirubicin hydrochloride 


1984 


ND 


M 


estramustine 


Pre- 1981 


ND 


A 


ethinyl estradiol 


Pre- 1981 


ND 


A 


etoposide 


Pre- 1981 


ND 


P 


etoposide phosphate^ 


1996 


ND 


P 


exemestane 


1999 


ND 


A 


fluoxymesterone 


Pre- 1981 


ND 


A 


formestane 


1993 


ND 


A 


fulvestrant 


2002 


ND 


A 


gemtuzumab ozogamicin 


2000 


ND 


M 


hydroxyprogesterone 


Pre- 1981 


ND 


A 


idarubicin hydrochloride 


1990 


ND 


M 


irinotecan hydrochloride 


1994 


ND 


P 


medroxyprogesterone acetate 


Pre- 1981 


ND 


A 


megesterol acetate 


Pre- 1981 


ND 


A 


methylprednisolone 


Pre- 1981 


ND 


A 
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Table 2 (Continued) 



Generic name 


Year introduced 


Chem-source 


Bio-source 


methyltestosterone 


Pre-1981 


ND 


A 


miltefosine 


1993 


ND 


A 


mitobronitol 


Pre-1981 


ND 


A 


pirarubicin 


1988 


ND 


M 


prednisolone 


Pre-1981 


ND 


A 


prednisone 


Pre-1981 


ND 


A 


teniposide 


Pre-1981 


ND 


P 


testolactone 


Pre-1981 


ND 


A 


topotecan hydrochloride 


1996 


ND 


P 


triamcinolone 


Pre-1981 


ND 


A 


triptorelin 


1986 


ND 


A 


valrubicin 


1999 


ND 


M 


vindesine 


Pre-1981 


ND 


P 


vinorelbine 


1989 


ND 


P 


zinostatin stimalamer 


1994 


ND 


M 


aminoglutethimide 


1981 


S 




amsacrine 


1987 


S 




arsenic trioxide 


2000 


s 




bisantrene hydrochloride 


1990 


s 




busulfan 


Pre-1981 


s 




camostat mesylate 


1985 


s 




carboplatin 


1986 


s 




carmustine 


Pre-1981 


s 




chlorambucil 


Pre-1981 


s 




chlortrianisene 


Pre-1981 


s 




cis-diamminedichloroplatinum 


Pre-1981 


s 




cyclophosphamide 


Pre-1981 


s 




dacarbazine 


Pre-1981 


s 




diethylstilbestrol 


Pre-1981 


s 




flutamide 


1983 


s 




fotemustine 


1989 


s 




heptaplatin /SK-2053R 


1999 


s 




hexamethylmelamine 


Pre-1981 


s 




hydroxyurea 


Pre-1981 


s 




ifosfamide 


Pre-1981 


s 




levamisole 


Pre-1981 


s 




lobaplatin 


1998 


s 




lomustine 


Pre-1981 


s 




lonidamine 


1987 


s 




mechlorethanamine 


Pre-1981 


s 




melphalan 


Pre-1981 


s 




mitotane 


Pre-1981 


s 




mustine hydrochloride 


Pre-1981 


s 




nedaplatin 


1995 


s 




nilutamide 


1987 


s 
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Table 2 (Continued) 



Generic name 


Year introduced 


Chem-source 


Bio-source 


nimustine hydrochloride 


Pre- 1981 


S 




oxaliplatin 


1996 


S 




pipobroman 


Pre- 1981 


s 




porfimer sodium 


1993 


s 




procarbazine 


Pre- 1981 


s 




ranimustine 


1987 


s 




sobuzoxane 


1994 


s 




thiotepa 


Pre- 1981 


s 




triethylenemelamine 


Pre- 1981 


s 




uracil mustard 


Pre- 1981 


s 




zoledronic acid 


2000 


s 




aminogluethimide 


Pre- 1981 


s* 


P 


capecitabine 


1998 


s* 


M* 


carmofur 


1981 


s* 


M* 


cytosine arabinoside 


Pre- 1981 


s* 


M* 


doxifl uridine 


1987 


s* 


M* 


enocitabine 


1983 


s* 


M* 


floxuridine 


Pre- 1981 


s* 


M* 


fludarabine phosphate 


1991 


s* 


M* 


fluorouracil 


Pre- 1981 


s* 


M* 


gemcitabine hydrochloride 


1995 


s* 


M* 


goserelin acetate 


Pre- 1981 


s* 


A 


leuprolide 


Pre- 1981 


s* 


A 


mercaptopurine 


Pre- 1981 


s* 


M* 


methotrexate 


Pre- 1981 


s* 


M 


mitoxantrone hydrochloride 


1984 


s* 


M 


tamoxifen 


Pre- 1981 


s* 


A 


thioguanine 


Pre- 1981 


s* 


M* 


bexarotene 


2000 


S*/NM 


P 


raltitrexed 


1996 


S*/NM 


M 


temozolomide 


1999 


S*/NM 


M 


anastrozole 


1995 


S/NM 




bicalutamide 


1995 


S/NM 




camostat mesylate 


1985 


S/NM 




fadrozole hydrochloride 


1995 


S/NM 




gefitinib 


2002 


S/NM 




imatinib mesilate 


2001 


S/NM 




letrazole 


1996 


S/NM 




toremifene 


1989 


S/NM 





"Marine involvement, see text. 

^Prodrug (not counted). 

M = Microbe; P = Plant; A = Animal; M* = Marine involvement (nucleoside). 

N, ND, S, S* S*/NM, S/NM; see Table 1 in text. 
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That there is still potential for doxorubicin, in spite of its cardiotoxicity, is demon- 
strated by the number of delivery techniques that have been either tested in clinical 
trials, are currently in clinical trials, or have been approved for use as methods of deliv- 
ery of this agent. These have included monoclonal antibodies bearing a doxorubicin 
warhead directed towards a particular epitope (predominately in leukemia treatment) 
and liposomal delivery systems involving both conventional (one is in clinical use in 
the USA) and "stealth" liposomes containing doxorubicin. In addition, doxorubicin is a 
favorite warhead for polymeric delivery systems. A group that has shown the utility of 
this approach is that of Duncan, with the reported clinical trials of PK1 and PK2 under 
the auspices of the United Kingdom cancer research consortia (6). 

Another series of extremely important molecules, also from the Actinomycetales , 
are the family of glycopeptolide antibiotics known as the bleomycins (Structure 8) 
(particularly A 2 ; Blenoxane®) and, initially, the closely related structural class, the 
phleomycins. These molecules were originally reported by Umezawa's group at the 
Institute of Microbial Chemistry in Tokyo, and developed as antitumor agents by 
Bristol-Myers. Their "original" MOA was elucidated by Hecht et al. (MIT and the 
University of Virginia), who demonstrated that a metal ion (Cu 2+ or Fe 2+ ) was required 
to activate the oxidative breakage of the DNA helix, once binding to the helices had 
occurred. For most of the years since the introduction of Blenoxane® into clinical use, 
the primary target was thought to be DNA. However, in a recent series of investiga- 
tions using both bleomycin and synthetic modifications of the molecule, Hecht has 
discovered that RNA interacts with the bleomycins at much lower concentrations than 
those required for any effect on DNA, thus opening up the possibility that the major 
site of action is, in fact, inhibition of t-RNA, not DNA (Hecht, personal communica- 
tion). Irrespective of their exact mechanism, these agents are major products in the 
clinician's armamentarium. 

Other microbial products that are still in use include the mitosanes, such as mitomy- 
cin C, the glycosylated anthracenone, mithramycin, streptozotocin, and pentostatin, 
and, most importantly, calicheamicin. The latter compound (Structure 9) is possibly 
the most potent antitumor compound to be approved for clinical use. It is a compound 
with in vitro activity at the sub-picomolar level, and for a significant number of years, 
it languished as it was just too toxic to pursue, in spite of its exquisite activity (7). It 
was also the progenitor of a new chemical class, the endiynes, which, on activation, 
undergo an unprecedented rearrangement and interaction with DNA to produce double- 
stranded DNA cleavage. The actual in vivo MOA of these molecules is still open to 
debate, but modern techniques are now being employed in elucidating the possible 
pathways involved (8). In 2000, Wyeth-Ayerst (originally Lederle, now Wyeth) gained 
approval for Mylotarg®, where the compound is linked to a specific monoclonal anti- 
body that is directed against chronic myologenous leukemia (9). 

2.2. Microbial Agents in Clinical Trials 

In addition to the agents listed above, there are some extremely interesting structural 
classes of agents isolated from microbes, or modified from such agents, in various 
phases of clinical trials. 

There are perhaps two major structural types that evince significant interest on the 
part of both chemists and cell biologists. These are the structures based on the 
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epothilones (Structure 10), from the extremely prolific Myxomycetales, and the 
indolocarboxazoles such as staurosporine (Structure 11), from the Actinomycetales. 

2.2.1. Epothilones 

The epothilones are of great interest as potential antitumor agents due to their MOA 
being the same as that of paclitaxel (vide infra), though having at first glimpse, quite a 
different topology. Molecular modeling, however, has shown that there are significant 
common structural features in the two basic molecules (10-12). Originally, the 
epothilones were difficult to obtain in any quantity, and a significant amount of time 
and effort was spent by both academics and industry in efforts to synthesize both 
epothilones A and B and their precursors, C and D. The major impetus behind all of 
these studies was the realization that the epothilones were active against paclitaxel- 
resistant cell lines (13). Bristol-Myers also synthesized the aza-derivative (Structure 12) 
and currently have it and epothilone B in phase II clinical trials. 
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During this time period, the group at Kosan licensed the work of Danishevsky on 
epothilone D, which he had synthesized at Memorial Sloan-Kettering (MSK), and 
cloned and expressed the polyketide gene cluster that produced epothilones A/B. 
Removal of the terminal gene for the P 450 enzyme that epoxidized the macrolidic 
double bond in the precursor epothilones C/D, and transfer to a different host, pro- 
duced crystalline epothilone D from a 100-L fermentation (14). This material is now 
undergoing phase II clinical trials in conjunction with MSK and Roche. There have been 
significant efforts expended on the total synthesis of this class of compounds (15,16). 

2.2.2. Indolocarbazoles 

The indolocarboxazoles first came into prominence with the identification of 
staurosporine and its simple derivative UCN-01 (Structure 13) as potential inhibitors 
of components of the eukaryotic cell cycle and of phosphokinase C (17). The differ- 
ences in the potential mechanisms of action of staurosporine and simple derivatives vs 
synthetic analogs were shown by a comparison with the synthetic compound, SB-21 8078 
(Structure 14). Although not a natural product, the compound SB-21 8078 recently 
reported by workers (18) at SmithKline Beecham (now Glaxo SmithKline) is defi- 
nitely derived from the indolocarbazole base structure; a reduced furan ring replaces 
the substituted pyranose sugar found in staurosporine, and the maleimide ring of the 
didemnimides replaces the reduced pyranone of staurosporine. This compound demon- 
strates at least a 15-fold selectivity for Chkl inhibition vs Cdkl (aka cdc2), and a 65- 
fold selectivity over PKC (IC 50 values of 15, 250, and 1000 nM respectively). In 
contrast, staurosporine is active against all three kinases at the 5-8 nM level, and UCN- 
01 has a 15-fold selectivity vs Cdkl, but has comparable activity against PKC (IC 50 
values of 7, 100, and 4 nM respectively). 

These workers also confirmed the findings of Shao et al. (19) as to the synergistic 
effect of UCN-01 on camptothecin cytotoxicity, and demonstrated that SB-218078 had 
comparable activity, implying that the mechanism of this abrogation with UCN-01 is 
due to Chkl inhibition. Concomitantly with this work, Graves et al. (20) reported that 
UCN-01 inhibits Chkl, thus causing the phosphatase Cdc25c to be dephosphorylated, 
and hence allowing activation of Cdkl (Cdc2) and abrogating the G2 checkpoint. 

Currently UCN-01 is still in phase I clinical trials. For further discussions as to the 
possibilities of this basic structural type as inhibitors of the cell cycle, the relatively 
recent review of Newman et al. (21) should be consulted, together with the review of 
Dancey and Sausville (22) on the potential of protein kinase inhibitors in general as 
candidate antitumor agents. 

The compounds related to the cytotoxin rebeccamycin (Structure 15) are extremely 
interesting from a mechanistic aspect, as through relatively simple modifications of the 
base indolo-carbazole, one can generate molecules that have both topoisomerase I and 
topoisomerase II inhibitory activity. The findings have led to the synthesis of water- 
soluble analogs, such as NB-506 (Structure 16) and second-generation products, all of 
which show excellent preclinical in vivo activity. The topoisomerase I activity of this 
class of compounds has been extensively studied (23), and one of the most interesting 
discoveries in this structural class has been the simple analog R-3 (Structure 17). This 
compound was first reported as a topoisomerase I inhibitor in 1997 (24), and not only 
inhibits topoisomerase I but also completely inhibits the phosphorylation of SF2/ASF, 
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a member of the SR protein family, in the absence of DNA. This compound is the only 
compound so far reported that inhibits the protein kinase activity of topoisomerase I, 
an activity first reported in 1996 (25) and reviewed in 1997 (26). A derivative (Struc- 
ture 18) whose generic name is edotecarin, though previously known as J-107088, is 
currently in phase III trials with Pfizer for therapy of brain carcinomas. If it is ulti- 
mately commercialized, it will be the first non-camptothecin-based topoisomerase I 
inhibitor. 

Simple alteration of the substitution pattern around the indolocarbazole moiety of 
rebeccamycin gives a molecule known as BMS-27557 or NSC 655649 (Structure 19), 
which now has topoisomerase II rather than topoisomerase I selectivity. This com- 
pound is now in phase II trials with Exelixis for a variety of carcinomas. 

2.2.3. Geldanamycin Derivatives 

It is noteworthy that the first signal transduction modulator, other than a formal Cdk 
or PKC inhibitor, to enter clinical trials is a microbial product. Thus 17-allylamino- 
geldanamycin (17-AAG) (Structure 20) entered phase I trials with the NCI and Kosan 
(the licensee) in 2001. This compound binds at the major ATP-binding site of the pro- 
tein chaperone HSP-90. Although now it is known not to be suitable for use a single 
agent in humans, it appears to be effective when used in combination with cytotoxins 
such as paclitaxel. Very recently (27), it was shown that the HSP-90 of BT474 breast 
tumor cells binds 17-AAG much more strongly (approx 100-fold higher affinity) than 
that in "normal" cells (human renal epithelial cells and dermal fibroblasts). That there 
are further avenues to explore in the inhibition of the protein chaperones can be seen 
from the recent publications of Neckers and Ivy (28) on HSP90 and its potential, and of 
Beliakoff et al. (29) on the potential for use of HSP90 inhibitors in dealing with hor- 
mone-refractory breast cancers. 

3. The Plant Kingdom 
3. 1. Approved Agents 

Plants have a long history of use in the treatment of cancer (30), though many of the 
claims for the efficacy of such treatment should be viewed with some skepticism because 
cancer, as a specific disease entity, is likely to be poorly defined in terms of folklore and 
traditional medicine (31). The first agents to advance into clinical use were the so-called 
vinca alkaloids, vinblastine and vincristine (Structures 21, 22), isolated from the Mada- 
gascar periwinkle, Catharanthus roseus, which was used by various cultures for the 
treatment of diabetes. These drugs were first discovered during an investigation of the 
plant as a source of potential oral hypoglycemic agents (31). What is also of interest is 
that vincristine is in fact a heterologous dimer of two alkaloids that apart are not active 
as cytotoxins, but chemically combined, the compound acts as a tubulin interactive 
agent. Recently, two semisynthetic analogs of vincristine, vinorelbine and vindesine, 
were approved for clinical use (32). 

Podophyllotoxin was isolated as the active antitumor agent from the roots of various 
species of the genus Podophyllum. These plants possessed a long history of medicinal 
use by early American and Asian cultures, including the treatment of skin cancers and 
warts (31). Although podophyllotoxin is still used as a topical treatment for warts (USP 
formulary), it was too toxic for use as an injectable antitumor agent. Following exten- 
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sive semisynthetic modifications of the isomeric natural product, epipodophyllotoxin, 
by Sandoz (now part of Novartis), two clinically approved agents, etoposide and 
teniposide (Structures 23, 24), were developed. 

More recent additions to the armamentarium of plant-derived chemotherapeutic 
agents are the taxanes and camptothecins. Taxol® (Structure 25) (33) (generic name 
paclitaxel) was initially isolated from the bark of Taxus brevifolia, collected in Wash- 
ington State as part of a random collection program by the U.S. Department of Agricul- 
ture for the National Cancer Institute (NCI) (34). The use of various parts of T. 
brevifolia and other Taxus species (e.g., canadensis, baccata) by several Native Ameri- 
can tribes for the treatment of some non-cancerous conditions has been reported (30), 
while the leaves of T. baccata are used in the traditional Asiatic Indian (Ayurvedic) 
medicine system (35), with one reported use in the treatment of "cancer" (30). 
Paclitaxel, along with several key precursors (the baccatins), occurs in the leaves/ 
needles of various Taxus species, and the ready semi-synthetic conversion of the rela- 
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tively abundant baccatins to paclitaxel, as well as to active paclitaxel analogs, such as 
docetaxel (36), has provided a major, renewable natural source of this important class 
of drugs. 

Paclitaxel was a pivotal discovery from a mechanistic aspect in the treatment of 
cancer, as it was the first compound identified that caused the stabilization of microtu- 
bules in the mitotic cycle. As a result of such stabilization, the cell stops division and 
usually undergoes programmed cell death (the apoptotic cascade). Although both 
colchicines and vinca alkaloids were known to interact with microtubules, it became 
obvious (37,38) that paclitaxel had an entirely different mechanism of action, effec- 
tively "freezing" the tubulin polymerization/depolymerization cycle. Paclitaxel entered 
clinical use in 1993 and docetaxel was approved in 1995, initially for treatment of 
refractory ovarian cancer and then breast cancer, though now the taxanes are being 
used in a large number of cancer treatments. 

It is rare for a single chemistry group to first identify a clinical agent with an 
undescribed MOA, and then discover another totally different chemotype with a previ- 
ously unknown MOA. However, Wall and Wani, the discoverers of paclitaxel, were 
also responsible for the identification, isolation, and purification of camptothecin 
(Structure 26). This compound, which they isolated from the Chinese ornamental tree 
Camptotheca acuminata (39), was entered into clinical trials as its sodium salt by the 
NCI in the late 1970s, but it turned out to be too toxic due to reformation of the lactone 
ring in the bladder. Subsequent work under a National Cooperative Drug Discovery 
Program (NCDDG) involving the NCI, Johns Hopkins University, and SmithKline 
Beckman (now Glaxo SmithKline) led to the synthesis of the semisynthetic derivative 
topotecan (Structure 27), which was approved in the United States in 1996. Another 
semi-synthetic derivative, irinotecan (Structure 28), was approved roughly 18 mos ear- 
lier in Japan. Until the identification of the indolocarbazole derivatives referred to ear- 
lier (Subheading 2.2.2.), the camptothecins were the only chemotype reported to 
consistently inhibit topoisomerase I activity. 

Elliptinium acetate (Structure 29), a derivative of ellipticine isolated from several 
genera of the Apocynaceae family, was approved in the early 1980s in France as a 
treatment for breast cancer. Though there is still some usage, it is probable that various 
taxanes are utilized in preference to this agent. 

3.2. Plant Agents in Clinical Trials 

A significant number of plant-sourced agents are still in clinical trials at one level or 
another. Some are being investigated as direct cytotoxins, whereas others are being 
studied from the aspect of their potential role as inhibitors of particular cell-cycle 
enzymes, proteins, or pathways. 

3.2.1. Homoharringtonine 

Homoharringtonine (Structure 30), from the Chinese tree Cepahlotaxus harringtonia 
var drupacea (Sieb and Zucc), is still in clinical trials against various leukemias in the 
West, but is reported as being used in China as an anticancer agent (40). 

3.2.2. Flavonoids 

An early report of a natural product compound class that ultimately led to Cdk inhibi- 
tors was that of Ranelleti et al. (41) on the antitumor effect of quercetin (Structure 31). 
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This compound from the class known as flavonoids can be thought of as an ATP mimic, 
where the flat bicyclic chromone ring is an isostere of adenine. Quercetin showed a 
dose-dependent and reversible inhibition of tumor cell proliferation over the range from 
10 nM to 10 uM. Subsequent cell-cycle analysis demonstrated a block at the GqIG x 
interphase, consistent with Cdk inhibition, though it should be noted that this com- 
pound has been reported to inhibit topoisomerase II; however, a close analog, myricetin 
(Structure 32), has shown an IC 50 close to 10 uM vs Cdk2 (42). 

Flavopiridol (Structure 33) was made by the Indian subsidiary of Hoechst (now 
Aventis) following the isolation and synthesis of the plant-derived natural product 
rohitukine (Structure 34). It was the most active of approx 100 analogs when assayed 
against Cdks and showed about 100-fold more selectivity compared to its activity vs 
tyrosine kinases. It showed roughly comparable activity in the 100-400 n/W range for 
IC 50 values, depending upon the specific Cdk. It was the first compound at NCI identi- 
fied as a potential antitumor agent that subsequently was proven to be a relatively 
specific Cdk inhibitor (43). The initial report (44) on its Cdk 2 inhibitory activity was 
made in 1994, followed by data demonstrating antitumor activity by Czech et al. (45) 
in 1995. Flavopiridol is currently in phase III clinical trials as an inhibitor of cyclin- 
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dependent kinase 2 (Cdk2), both as a single agent and as a modulator in combination 
with other agents, particularly paclitaxel and cis-platinum. It has been reported to lead 
to partial and/or complete remissions in a number of phase I patients, leading to phase 
II studies in patients with paclitaxel-resistant tumors (46). There have been a number 
of relatively recent reports of this agent and the various combinations with other drugs 
and drug candidates, a significant number of which are either natural products or derived 
from natural products (22,47,48). 

3.2.3. Combretastatins 

The combretastatins, derived from Combretum caffrum, are a family of stilbenes 
which act as anti-angiogenic agents, causing vascular shutdown in tumors and result- 
ing in tumor necrosis (49). A water-soluble analog, combretastatin A-4 phosphate 
(Structure 35), has shown promise in early clinical trials and is currently in phase II. 
These relatively simple compounds, which interact with tubulin (though they do not 
stabilize polymerized tubulin), have been the progenitors of a very large number of 
synthetic molecules that are currently in a variety of testing stages ranging from pre- 
clinical to phase I clinical trials. The evolution of a significant number of derivatives is 
documented in reviews by Li and Sham (50) and Cragg and Newman (51). 

3.3. Comments on Current Use of Plant Sources 

Approximately 40% of the 150 compounds listed in the recent review (51) of tubu- 
lin-interactive and topoisomerase I inhibitors currently in preclinical and clinical trials, 
are based upon steroids, taxanes, combretastatins, or camptothecins. 

4. Marine-Derived Agents 
4. 1. Introduction 

With the advent of marine natural-product research approximately 50 yr ago, it has 
become rather obvious that the marine environment produces chemical compounds 
that have significant activities against mammalian cells. Perhaps the first notable dis- 
covery of biologically active compounds from marine sources was reported from 1950 
to 1956 by Bergmann et al. (52-54) on two compounds, spongouridine and spong- 
othymidine (Structures 36, 37), which they had isolated from the Caribbean sponge, 
Cryptotheca crypta. What was significant about these materials was that they demon- 
strated for the first time that naturally occurring nucleosides could be found containing 
sugars other than ribose or deoxyribose. 

These two compounds can be thought of as the prototypes of all of the modified 
nucleoside analogs made by chemists that have crossed the antiviral and antitumor stages 
since then. Once it was realized that biological systems would recognize the base and 
not pay too much attention to the sugar moiety, chemists began to substitute the "regular 
pentoses" with acyclic entities, and with cyclic sugars bearing unusual substituents. 
These experiments led to a vast number of derivatives that were tested extensively as 
antiviral and antitumor agents over the next 30-plus years. Suckling (55) showed how 
such structures evolved in the (then) Wellcome laboratories, leading to azidothymidine 
(AZT) and, incidentally, to Nobel prizes for Hitchens and Elion, though no direct men- 
tion was made of the original arabinose-containing leads from natural sources. 

Showing that "Nature may follow chemists rather than the reverse, or conversely 
that it was always there but the natural products chemists were slow off the mark," 
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arabinosyladenine (Ara-A or Vidarabine®) was synthesized in 1960 as a potential anti- 
tumor agent (56), was later produced by fermentation (57) of Streptomyces griseus, 
and was isolated together with spongouridine (58) from a Mediterranean gorgonian 
(Eunicella cavolini) in 1984. 

Of the many compounds derived from these early discoveries, some, such as Ara-A, 
Ara-C, Acyclovir, and later AZT and DDI, have gone into clinical use, but most have 
simply become entries in chemical catalogs. Though this class of compounds was 
known to possess antiviral and antitumor activities, synthetic analog studies led to the 
development of cytosine arabinoside (Ara-C) (Structure 38) as a clinically useful anti- 
cancer agent approx 15 yr later. 

Currently there are no approved agents that have come directly from the marine 
environment, but a very significant number of marine-sourced compounds have either 
been in clinical trials or are currently in clinical or preclinical trials as anticancer agents. 

Table 3 shows the current status of compounds from all marine sources that were or 
are now in clinical trials as antitumor agents, and some that are in preclinical testing. 
That the marine environment should produce a very large number of exquisitely potent 
agents is not surprising when it is realized that the majority of marine invertebrates 
have no methods of physical defense against predators. Aside from mollusks, which 
tend to have a hard carapace, the other phyla lack any form of outward defense, and 
thus rely upon "chemical warfare." A probable reason for the presence of extremely 
active chemotypes is simply that the dilution effect of seawater means that any toxin 
must be very potent, as it usually has to cross a finite distance to deter any predator. 

4.2. Didemnin B, the First Agent to Enter Clinical Trials 

Didemnin B (Structure 39) was isolated by Rinehart's group from extracts made of 
the tunicate Trididemnum solidum (59) that demonstrated potent antiviral and cyto- 
toxic activity against P388 and L1210 murine leukemia cell lines. Didemnin B was 
subsequently advanced into preclinical and clinical trials (phases I and II) under the 
auspices of the NCI in the very early 1980s as the first defined chemical compound 
directly from a marine source to go into clinical trials for any major human disease. 
The compound turned out to be too toxic for use, and trials were officially terminated 
in the mid-1990s by NCI. However, the experience gained from these efforts was 
immensely helpful in aiding the trials of other compounds. 

Rinehart's group developed methods of large-scale isolation and purification, and 
total syntheses that permitted significant structure-activity relationships to be derived 
(60). This enabled materials to be provided to other investigators so that basic bio- 
chemical studies could be performed, leading to the identification of a potential mecha- 
nism of action for this compound — namely, the binding to elongation factor 1-oc 
reported by Crews et al. (61). Subsequent reports showed that didemnin B binds 
noncompetitively to palmitoyl protein thioesterase (62), and the following year, it was 
reported (63) that rapamycin inhibited the didemnin-induced apoptosis of human HL- 
60 cells, perhaps by binding to the FK-506 binding protein(s). 

In 2002, Vera and Joullie (64) published an excellent review of didemnins as cell 
probes and targets for syntheses, and also made some reasonable arguments that the 
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dosing schedules used in the early clinical trials may well have been nonoptimal for 
demonstrating activity as a cytotoxin, rather than as an immunosuppressive/modulator. 
Although didemnin B was not successful, a very close chemical relative is currently 
in clinical trials (see aplidine, Subheading 4.4.3.)- Recently, Rinehart published an 
overview of these compounds as part of a discussion of antitumor compounds from 
tunicates (65). 

4.3 Post-Didemnin B Compounds That Entered Clinical Trials 

A significant number of compounds from marine sources that have been entered 
into preclinical and clinical trials since the early 1980s, emanated from programs 
funded by the US National Cancer Institute. The NCI has supported many of the 
searches for agents active against cancer, via a variety of mechanisms, ranging from 
random to targeted shallow-water collections, by both direct contractors and via grants 
to academics, both individually and as parts of consortia. Thus, NCI has the systems in 
place for collection, bioactivity determinations, and subsequent testing in animals and 
humans, with the aim of finding new and potent treatments for cancers. In addition, the 
National Science Foundation, via grants to academics for basic marine research, has 
significantly contributed to the discovery process, as the agents that were identified 
were then tested via the NCI in a significant number of cases. 

Because of the extremely long time frame involved in such processes (for example, 
paclitaxel, took over 20 yr from structural determination and reporting until US Food 
and Drug Administration [FDA] approval), the compounds that will be discussed fall 
into two approximate time frames: those from the initial collection programs (which 
aided the didemnin B discovery), and those that are further back in the system, which 
have been discovered as a result of the modified NCI screens utilizing the 60-cell line 
(or functional equivalent) screen from the early 1990s. 

4.3.1. Dolastatins 

The dolastatins are a series of cytotoxic peptides that were originally isolated in very 
low yield by Pettit's group (66-70) as part of its work on marine invertebrates from the 
Indian Ocean mollusk, Dolabella auricularia. Owing to the potency and mechanism of 
action of dolastatin 10 (Structure 40), a linear depsipeptide that was shown to be a 
tubulin-interactive agent binding close to the vinca domain in a site where other pep- 
tidic agents bound (71,72), the compound entered phase I clinical trials in the 1990s 
under the auspices of the NCI. Because the natural abundance was so low, Pettit and 
others developed synthetic methods that provided enough material under cGMP condi- 
tions to commence trials (66). Dolastatin 10 progressed through to phase II trials as a 
single agent, but it did not demonstrate significant antitumor activity in a phase II trial 
against prostate cancer in humans (73). Similarly, no significant activity was seen in a 
phase II trial against metastatic melanoma (74). 

As a result of the synthetic processes alluded to earlier, many derivatives of the 
dolastatins have been synthesized, with TZT-1027 (Auristatin PE or Soblidotin) (Struc- 
ture 41) now in phase II clinical trials in Europe, Japan, and the United States under the 
auspices of either Teikoku Hormone, the originator, or the licencee, Daiichi Pharma- 
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Table 3 

Status of Selected Marine-Derived Compounds in Clinical and Preclinical Trials (12/03) 



Name 



Source 



Status (Disease) 



Comment 



Didemnin B 
Bryostatin 1 



Dolastatin 10 



Cematodin 

TZT-1027 

Girolline 
Ecteinascidin 743 



Aplidine 



E7389 



Discodermolide 



Trididenmum solidum 
Bugula neritina 



Dolabella auricularia 
(Marine microbe derived; 
cyanophyte) 



Synthetic derivative 
of Dolastatin 15 (Marine 
microbe derived; 
cyanophyte) 

Synthetic Dolastatin 



Pseudaxinyssa cantharella 
Ecteinascidia turbinata 



Aplidium albicans 



Lissodendoryx sp. 



Discodermia dissoluta 



Phase II (Cancer) 
Phase II (Cancer) 



Phase II (Cancer) 



Phase II (Cancer) 



Phase II (Cancer) 



Phase I (Cancer) 

Phase III (Cancer) 
in 2003. EC CPMP 
did not approve rapid 
marketing for sarcoma 
(07-11/2003) but 
orphan drug designation 
by EC for ovarian 
cancer (10/03) 

Phase II (Cancer) 
EC COMP/EMEA 
approved orphan drug 
status for ALL (07/2003) 

Phase I (Cancer) 



Phase I (Cancer) 



Dropped mid-1990s 

Now in combination 
therapy trials; 
licenced to GPC 
Biotech in Germany 
by Arizona State 
Univ. May be pro- 
duced by bacterial 
symbiont 

Many derivatives 
made synthetically; 
now known to be a 
cyanobacterial 
product. No positive 
effects in phase II 
trials so far reported 

Some positive effects 
on melanoma pts 
in phase II. Listed 
as discontinued. 

Also known as 
Auristatin PE and 
Soblidotin. 

Discontinued due to 
hypertension 

Licenced to Ortho 
Biotech (J&J). 
Partial synthesis 
from a microbial 
product for 
production 



Dehydrodidemnin B 



Eisai's synthetic 
halichondrin B 
derivative. 

Licenced to Novartis 
by Harbor Branch 
Oceanographic 
Institution 



(continued) 



Anticancer Drugs From Natural Sources 
Table 3 (Continued) 



145 



Name 



Source 



Status (Disease) 



Comment 



Kahalalide F 



Elysia rufescens/ Bryopsis sp. Phase II (Cancer) 

early 2003 



ES-285 (Spisulosine) Spisula polynyma 

HTI-286 

(Hemiasterlin derivative) Cymbastella sp. 



Bengamide derivative Jaspis sp. 



Phase I (Cancer) 
Phase I (Cancer) 

Phase I (Cancer) 



Cryptophycins Nostoc & Dysidea arenaria Phase I (Cancer) 

(also Arenastatin A) 



KRN-7000 

Squalamine 
AE-941 (Neovastat) 



Laulimalide 

Curacin A 

Vitilevuamide 

Diazonamide 

Eleutherobin 



Dictyostatin-1 
Sarcodictyin 
Peloruside A 
Salicylihalimides A & B 



Agelas mauritianus 

Squalus acanthias 
Shark 



Phase I (Cancer) 

Phase II (Cancer) 
Phase III (Cancer) 



Cacospongia mycofijiensis Preclinical (Cancer) 



Lyngbya majuscula 



Preclinical (Cancer) 



Didemnum cucliferum Preclinical (Cancer) 

& Polysyncraton lithostrotum 
Diazona angulata Preclinical (Cancer) 



Eleutherobia sp. 



Spongia 

Sarcodictyon roseum 
Mycale hentscheli 
Haliclona sp. 



Preclinical (Cancer) 



Preclinical (Cancer) 
Preclinical (Cancer) (derivatives) 
Preclinical (Cancer) 
Preclinical (Cancer) 



Licenced to 
PharmaMar by 
Univ. Hawaii. Syn- 
thesized by Univ. 
Barcelona chemists. 

Rho-GTP inhibitor 

Derivative made by 
Univ. British 
Columbia; licensed to 
Wyeth. 

Bengamide A licenced 
to Novartis; deriva - 
tive made that was a 
Met- API inhibitor, 
withdrawn 2002 

From a terrestrial 
cyanophyte, but also 
from a sponge as 
Arenastatin A. Syn- 
thetic derivative 
licenced to Lilly by 
Univ. Hawaii, but 
withdrawn 2002 

An Agelasphin 
derivative 

Squalamine 

"Defined mixture of 
<500 kDa from 
cartilage" 

Synthesized by many 
groups in both partial 
and complete syn- 
thetic schemes 

Synthesized, more 
soluble combi-chem 
derivatives being 
evaluated 



Synthesized and new 
structure elucidated 

Synthesized and deriva- 
tives made by combi- 
chem. Can be made by 
aquae ulture 

Combi-chem synthesis 

First marine Vo-ATPase 
inhibitors 
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ceuticals. Another derivative of dolastatin 15, known as Cematodin (Structure 42) or 
LU- 103793, was entered into phase I clinical trials by the Knoll division of Abbott 
GMBH for treatment of breast cancer. It progressed into phase II, but trials appear to 
have been discontinued. By using tritium-labeled dolastatin 15 as a bioprobe, Hamel's 
group at NCI (75) recently reported that the vinca domain in tubulin may well be com- 
posed of a series of overlapping domains rather than being a single entity, as different 
levels/types of competition were found when selected tubulin-interactive agents were 
used to investigate the binding characteristics of the labeled probe. 

Similarly to bryostatin (see Subheading 4.4.1.). there was always a potential ques- 
tion with the dolastatins as to whether or not they were microbial in origin, as peptides 
with unusual amino acids have been well documented in the literature as coming from 
the Cyanophyta. In the last few years, this supposition has been shown to be fact. Thus, 
in 1998, workers at the Universities of Guam and Hawaii reported the isolation and 
purification of simplostatin 1 (Structure 43) from the marine cyanobacterium Simploca 
hynoides (76). This molecule differed from dolastatin 10 by the addition of a methyl 
group on the first ,/V-dimethylated amino acid. Subsequently, in 2001 , the same groups 
reported the direct isolation of dolastatin 10 from another marine cyanobacterium that 
was known to be grazed on by D. auricularia (77). Dolastatin 10 was in fact isolated 
from the nudibranch following feeding of the cyanophyte, thus confirming the original 
hypothesis (personal communication, Dr. V. J. Paul). Recently, the mechanism of action 
of symplostatin 1 was shown to be similar to dolastatin 10 but to be somewhat more toxic 
to mice at comparable doses (78). 

4.3.2. Girolline (Girodazole) 

This very simple compound, a substituted imidazole (Structure 44), was reported 
from the sponge Pseudaxinyssa cantharella (79), and was shown by workers at Rhone- 
Poulenc Rorer (now Aventis) to be an inhibitor of protein synthesis, acting preferen- 
tially on the termination step in eukaryotic protein synthesis. This is in contrast to other 
known protein synthesis inhibitors, such as emetine, homoharringtonine, anguidine, 
and bruceantin, which generally act at either the initiation or elongation steps (79). 
Girolline proceeded through to phase I clinical trials in humans, but the trials were 
stopped due to significant hypertensive effects seen in treated patients. 

4.3.3. Bengamide Derivatives 

Bengamides A and B (Structures 45, 46) were first reported in 1986 by Crews's 
group at the University of California, Santa Cruz (UCSC), as antihelminthic com- 
pounds, with some antibiotic and cytotoxic activity (80). The number of bengamide 
analogs isolable from the same sponge was extended to bengamide G (81,82). In a 
subsequent paper with workers from Novartis, the number of compounds in the group 
was extended, and their antitumor activities reported (83). 

The bengamides were evaluated by Novartis (then Ciba-Geigy). As a result of their 
intrinsic activities, a synthetic program was put in place that developed a derivative of 
bengamide A (Structure 47) as a clinical candidate. This compound was shown to be an 
inhibitor of methionine aminopeptidases and subsequently entered phase I clinical tri- 
als in 2000, but was withdrawn in the middle of 2002. 
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4.3.4. Cryptophycins 

These compounds were reported from two blue-green algae, initially by a group 
from Merck in 1990 using Nostoc sp. strain MB5357 (ATCC 53789), which was origi- 
nally isolated from a lichen on a Scottish island; they reported only the antifungal 
activity, finally deciding not to proceed with development as it was too toxic. Moore's 
group at the University of Hawaii subsequently identified the same compound (84) 
from a nonmarine cyanophyte, Nostoc sp., strain GSV-224, and almost contemporane- 
ously, a similar molecule was reported by Kobayashi et al. from an Okinawan sponge 
(see next paragraph). The Universities of Hawaii and Wayne State then licensed the 
natural cryptophycins and synthetic derivatives to the Eli Lilly Company for advanced 
preclinical and clinical development. This led to the selection of a semi-synthetic 
derivative, cryptophycin 52 (LY355703) (Structure 48), as a phase I clinical candi- 
date in the middle 1990s (85). The routes, both chemical and pharmacological, leading 
to the choice of this particular derivative were described by Shih and Teicher (86) of 
the Lilly Research Laboratories. The compound progressed towards phase II trials, but 
in 2002, cryptophycin 52 was withdrawn from trials. 

Although the original cryptophycins came from terrestrial cyanophytes, and the clini- 
cal candidate came from semisynthetic modifications of the natural product, in 1994 
Kobayashi et al. reported that an acetone extract of the Okinawan sponge, Dysidea 
arenaria, had potent cyctoxicity (87) and yielded a compound (arenastatin A) that sub- 
sequently turned out to be identical to cryptophycin 24 (Structure 49), reported by 
Moore's group in 1995 (88). A later report from the Japanese group demonstrated that 
arenastatin A and synthetic analogs also are tubulin-interactive agents similar in activ- 
ity to the other cryptophycins reported by Moore et al. (89,90). 

4.4. Compounds Currently (December 2003) in Clinical Trials 

4.4.1. Bryostatin 

In 1968, NCI commissioned a large-scale (for those days) collection of the bryo- 
zoan, Bugula neritina, by Jack Rudloe of the Gulf Specimen Company off the west 
coast of Florida, and this collection was sent to Pettit's group for workup. The extract 
was found to be inactive against the L1210 murine leukemia cell line but to give a 68% 
increase in life span using the same concentration against the P388 murine leukemia 
cell line (91). Following very significant amounts of work by Pettit and his group, 
including more collections on a larger scale, significant problems with isolation as a 
result of dealing with vanishingly small quantities of a very potent agent, and problems 
related to assay reproducibility, the compound was purified and identified as bryostatin 
3 (Structure 50), one of a series of closely related compounds that now number 20. 

Subsequent work by Pettit's group identified two other geographic areas where sig- 
nificant (in relative terms) quantities of bryostatin 1 (Structure 51) could be isolated 
from B. neritina colonies. As a result of prodigious efforts on the part of the Pettit 
group and workers at NCI-Frederick, by 1990 there was enough cGMP-grade material 
to commence systematic clinical trials. From these studies (92), it was determined that 
bryostatins bind to the same receptors as the phorbol esters, which are tumor promot- 
ers, but have no tumor promoter activity. As a result of this binding, however, the 
protein kinase C isoforms in various tumor cells are significantly down-regulated, lead- 
ing to inhibition of growth, alteration of differentiation, and/or death. 
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To date, bryostatin 1 has been in more than 80 human clinical trials, with more than 
20 being completed at both the phase I and phase II levels. There have been some 
responses to the compound as a single agent, with effects ranging from complete remis- 
sion (CR) and partial remission (PR) to stable disease (SD). However, the use as a single 
agent is probably not the optimal usage for this agent (92,93). When combined with 
cytotoxic drugs such as the vinca alkaloids, paclitaxel, fludarabine, cisplatin, and so 
on, responses in phase I trials have been reported (92), and currently three phase I and 
five phase II trials are ongoing in a number of centers, using a variety of protocols. 
Details can be obtained from http://clinicaltrials.gov for these and for other agents in 
clinical trials. The major side effect appears to be myalgia, but this is treatable by 
standard supportive therapies. 

One very interesting question arising from the search for bryostatin sources was 
why is the nominal producing organism so ubiquitous, but the number of B. neritina 
colonies that actually produce detectable bryostatin 1-3 levels so low and geographi- 
cally spread? One possible answer comes from the work of Haygood and her collabo- 
rators at the Scripps Institution of Oceanography. Haygood showed that the bryozoan 
is actually the host to a symbiotic organism that may well be the actual producer of the 
compound; in an elegant series of experiments, she and her colleagues demonstrated 
the presence of a putative PKS-I gene fragment in colonies that produced bryostatin, 
that was absent in non-producers (94). In addition, Davidson and Haygood demon- 
strated that there are subdivisions within B. neritina samples taken from the same sites 
but at different depths. Thus, at depths greater than 9 m, (the D type), bryostatins 1-3 
and minor components are found, whereas at less than 9 m (S type), only the minor 
derivatives are seen. The symbiotic organisms isolated from each type differ in certain 
aspects of their respective 16S ribosomal sequences, giving rise to the possibility that 
the bryozoans are also different taxonomically (95). Reports at the Society for Indus- 
trial Microbiology meetings in 2002 and 2003 demonstrated that Haygood and col- 
laborators were pursuing the possibility of transferring this particular PKS fragment to 
other, more amenable microbes in order to further investigate the possibility of produc- 
ing bryostatin by fermentative means. At the September 2003 International Marine 
Biotechnology Conference in Chiba, Japan, Haygood (96) reported on the current sta- 
tus of the PKS search, suggesting that this system resembled that reported by Piel (97) 
for the Paederus beetle's pseudomonal symbiont PKS, which produces pederine, in 
that there are no AT domains in the clusters. Further work is ongoing, utilizing "remote" 
AT domains from another organism. 

4.4.2. Ecteinascidin 743 

Antitumor activity of extracts from the ascidian, Ecteinascidia turbinata, had been 
reported as early as 1969 by Sigel et al. (98), but it was not until 1990 that the structures 
of the active components were published simultaneously by Rinehart et al. (99) and 
Wright et al. (100). 

The structure of the most stable member of the series, known as Et743 from the 
absorption maximum, is shown below (Structure 52). The basic structure, without the 
exocyclic isoquinoline group, is a well-known base structure (101) of compounds from 
both microbes (saframycins, naphthyridinomycins, safracins, quinocarcins) and 
sponge-derived metabolites (renieramycins, with the latest variation, renieramycin J 
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being reported by Oku et al. [102]), but the exocyclic substituent was novel, as was the 
bridging sulfur. The yield from natural sources was very low, and in order to provide 
enough material to perform basic studies as to the mechanism of action, and in vitro 
and in vivo animal studies, very significant amounts of the ascidian had to be collected 
from areas around the Caribbean. The compound was synthesized by Corey (103) and, 
as a result, he made a phthalimide-substituted version known as phthalascidin, which 
demonstrated reasonable activity in the regular test systems (104). 

Et743 was licensed by the University of Illinois to the Spanish company PharmaMar 
for subsequent development. PharmaMar chemists then performed an elegant semi- 
synthesis from the marine Pseudomonas fluorescens metabolite safracin B, which pro- 
vided cGMP-grade Et743 from a 21 -step synthetic process on a scale large enough to 
provide enough material for clinical trials (105). This was made possible in spite of a 
low overall yield of 1.4% because the starting material could be obtained on a large 
scale by fermentation. The mechanisms of action of Et743 have been shown to include 
the following at physiologically relevant concentrations: effects on the transcrip- 
tion-coupled nucleotide excision repair process (TC-NER) (106,107); and interac- 
tion between the Et743 DNA adduct and DNA transcription factors, in particular 
the NF-Y factor (108). 

The compound was placed into human clinical trials while these mechanisms were 
being worked out, and so far it has been administered to over a 1000 patients in phase 
I and phase II trials covering a variety of cancers. Results from the European phase I 
and pharmacokinetic trials were recently reported by Twelves et al. (109). As a result 
of these trials, Et743 was preregistered in the EU and granted orphan drug status for 
sarcoma by the European Commission's Committee for Orphan Medicinal Products 
(COMP) of the European Agency for the Evaluation of Medicinal Products (EMEA). It 
was licenced to Johnson and Johnson (Ortho Biotech) in 2001 under the brand name 
Yondelis®, with the generic name of trabectedin. It is currently in a variety of phase II 
trials in the United States for the treatment of sarcoma, and is in phase II trials in 
Europe for this and ovarian cancer. Two reports on the phase II trials were recently 
published (110,111), giving details of toxicities and response levels predominantly in 
sarcoma patients. Recently, the EU's Committee for Proprietary Medicinal Products 
(CPMP) recommended that marketing authorization for advanced soft-tissue sarcoma 
not be granted for the EU under a rapid orphan drug classification, and this decision 
was upheld on appeal in November 2003. However, the same agency granted orphan 
drug status for ovarian cancer in October 2003, so trials are still ongoing. 

4.4.3. Aplidine 

This compound, formally dehydrodidemnin B (Structure 53), was first reported in a 
patent application (112) in 1991 and then referred to in a paper from Rinehart's group 
in 1996 on the structure-activity relationships amongst the didemnins (60). The antitu- 
mor potential was first reported by PharmaMar scientists (113,1 14) in 1996, and the 
total synthesis was reported in a patent application (115) in 2000. 

The compound, generic name "aplidine or dehydrodidemnin B" and a trade name of 
Aplidin®, was entered into phase I clinical trials in 1999 under the auspices of 
PharmaMar in Canada, Spain, France, and the UK for treatment of both solid tumors and 
non-Hodgkin's lymphoma. These were successfully completed with over 200 patients, 
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and demonstrated that a dosage of up to 5 mg.M 2 was well tolerated in either a 3-h or 
24-h infusion every other week (116). The dose-limiting toxicity was muscle pain that 
was responsive to either dose limitation or addition of carnitine. Interestingly, in the 
presence of carnitine, the maximum tolerated dose could be increased by 40% to 7 mg.M 2 . 
Phase II clinical trials are now underway in Europe comparing the two dosage regimens 
in renal and colon carcinomas, together with an outpatient regimen, and, in July 2003, 
the European Commission's COMP/EMEA awarded orphan drug status for acute lym- 
phoblastic leukemia (ALL) to Aplidin®. 

The precise mechanism of action of this agent is not yet known, but it appears to 
block vascular endothelial growth factor (VEGF) secretion and block the correspond- 
ing VEGF-VEGF-Receptor-1 (also known a.sflt-1) autocrine loop in leukemic cells 
(117). In addition, effects on jnk, src, and p38-MAPK, possibly mediated via glu- 
tathione depletion, were recently reported (118), with the end result being induction of 
the apoptotic cascade in MDA-MB-231 cells at aplidine levels of 5 nM, which are 
below the blood levels achievable in humans. In these experiments, general caspase 
inhibitors decreased apoptotic efficacy by approx 50%, thus implicating at least two 
different mechanisms of apoptosis, one via caspases, the other not involving caspase 
activation. 

What is very interesting is that the removal of two hydrogen atoms (i.e., conversion 
of the lactyl side chain to a pyruvyl side chain) appears to significantly alter the toxic- 
ity profile, as this is the only formal change in the molecule when compared to didemnin 
B, though the comments on dosage regimens under didemnin B (Subheading 4.2.) 
from Vera and Joullie should be taken into account when such comparisons are made 
in the future. 

4.4.4. Halichondrin B (and Derivatives) 

Halichondrin B (Structure 54) is one of a series of compounds originally isolated by 
Uemura et al. in 1985 from the Japanese sponge Halichondria okadai (119,120). Sub- 
sequently, a number of sponges from other areas of the Pacific and Indian oceans were 
reported to contain one or more of these macrolides, including Axinella sp. from the 
Western Pacific (121), Phakellia carteri from the Eastern Indian Ocean (122), and 
Lissodendoryx sp. off the East Coast of South Island, New Zealand (123). 

In 1992, Kishi's group at Harvard, funded by the NCI, reported that they had suc- 
cessfully synthesized both halichondrin B and norhalichondrin B (124). Working with 
the US division of the Japanese pharmaceutical company Eisai, Kishi's synthetic 
schemes were utilized to synthesize a large number of variants of halichondrin B, par- 
ticularly smaller molecules that maintained biological activity but were intrinsically 
more chemically stable, due to the substitution of a ketone for the ester linkage in the 
macrolide ring. Two of these agents were subsequently evaluated by NCI in conjunc- 
tion with the Eisai Research Institute in the United States, and one, originally ER- 
086526 (NSC 707389) and now renumbered E7389 (Structure 55), is now in phase I 
clinical trials in conjunction with the NCI. 

Details of the biology and chemistry of this compound and other compounds in the 
series have recently been published by Eisai scientists, thus demonstrating the power 
of current synthetic chemistry when applied to a very potent marine-derived natural 
product (125,126). Using the synthetic techniques described, enough cGMP material, 
produced by total synthesis, was provided to the NCI for the projected clinical trials. 
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4.4.5. Discodermolide 

This polyhydroxylated lactone (Structure 56) was reported by the Harbor Branch 
Oceanographic Institution (HBOI) group in 1990 following isolation from the Carib- 
bean sponge Discodermia dissoluta, originally collected at a depth of 133 m off the 
Bahamas (127), with a revision to the stereochemistry being published the following 
year (128). Originally, the compound was judged to be a new immunosuppressive agent 
and an incidental cytotoxin, but in 1996, it was reported that discodermolide bound to 
microtubules more potently than Taxol®, a discovery that confirmed in silico studies at 
the University of Pittsburgh (129). The reports from HBOI led to synthesis of the (-) 
isomer by Schreiber's group (130). Subsequently, other groups made the same isomer, 
and then in the late 1990s to 2003, Marshall and Johns (131), Halstead (132), Smith et 
al. (133), and Paterson et al. (1 34) all reported syntheses that produced varied isomers 
in good yield. Kilogram amounts are now achievable by total synthesis (135). In the 
interim, HBOI licensed the compound to Novartis as a preclinical candidate, and it is 
now in phase I clinical trials as a potential treatment against solid tumors. 

The HBOI group are still discovering more derivatives of the natural product, and 
the same group isolated a marine macrolide, dictyostatin 1 (Structure 57), having some 
of the structural features of discodermolides, from a deep-water lithistid sponge, by 
following a tubulin interaction assay rather than relying on cytoxicity (136). This 
material had previously been reported, but as a cytotoxin, from a Maldavian Spongia 
species by Pettit et al., and in a relatively recent paper, Horwitz's group demonstrated 
how discodermolide and Taxol may well fit into the same site on tubulin (137). 

4.4.6. Kahalalide F 

This cyclic depsipeptide (Structure 58) was isolated from the Sacoglossan mollusk 
Elysia rufescens, following grazing by the mollusk on a green macroalga, Bryopsis sp. 
After isolation and identification, it was discovered that the depsipeptide also occurs in 
the alga, but on a wet-weight basis, the mollusk concentrates the depsipeptides signifi- 
cantly (138, 139). 

The compound was licensed to PharmaMar by the University of Hawaii in the 1990s, 
and it entered preclinical testing. Its full MOA has not yet been determined, but it was 
known to target lysosomes (140), thus suggesting selectivity for tumor cells with high 
lysosomal activity, such as prostate tumors. The compound was synthesized in a very 
efficient manner using solid-phase peptide techniques (141), and entered phase I clini- 
cal trials in Europe in December 2000 for the treatment of androgen-independent pros- 
tate cancer. The compound has now entered phase II trials, predominantly as a treatment 
for prostate cancer. Recently, Suarez et al. (142) demonstrated that kahalalide F induces 
cell death via "oncosis" (progression of cellular processes leading to necrotic cell death), 
possibly initiated by lysosomal membrane depolarization. 

4.4.7. Spisulosine 

In 1999, workers from PharmaMar reported on the initial studies with a molecule 
known as ES-285 or spisulosine (Structure 59), which was isolated from the marine 
clam Spisula polynyma. They demonstrated that this compound causes a loss of actin 
stress fibers, which may well be a result of its resemblance to lysophosphatidic acid 
(LPA) and hence an interaction with the LPA receptor, which is known to modulate the 
levels of the Rho proteins (143). This molecule is currently in phase I trials in Europe 
under the aegis of PharmaMar. 
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4.4.8. HTI-286 (Hemiasterlin Derivative) 

Hemiasterlin (Structure 60) was originally reported by Kashman's group from the 
South African sponge Hemiasterella minor, an organism that also contained jaspamide 
and geodiamolide TA (144). This was quickly followed by the report of a group of 
peptides that exhibited significant cytotoxic activities from a Papua New Guinea sponge 
that was originally classified as Pseudoaxinyssa sp. but, due to a taxonomic revision, 
the genus was changed to Cymbastela. This particular sponge produced a number of 
peptides, including geodiamolides A to F, hemiasterlin as described by Kashman, and 
two novel hemiasterlins, A and B, plus other geodiamolides and criamides (145). In 
1997, following testing of the hemiasterlin and the A and B derivatives in experiments 
to determine their mechanism of action, it was discovered that these agents interact 
with tubulin to produce microtubule depolymerization in a manner similar to that 
reported for nocodazole and vinblastine (146). Further investigations by Hamel's 
group using hemiasterlin isolated at NCI (147) indicated that this peptide, together 
with cryptophycin 1 and dolastatin 10, inhibits tubulin assembly and probably is bound 
at what is being called the "peptide binding site" (148). 

In the intervening time, Andersen at the University of British Columbia (UBC) com- 
menced a synthetic program in order to produce the original hemiasterlin using a syn- 
thetic scheme that would permit variations on the overall structure in order to determine 
SAR requirements (149). The hemiasterlins, including the analogs made by Andersen's 
group (which included HTI-286 (Structure 61) (150), were licensed by the UBC to 
Wyeth for development as part of an NCDDG program, of which Andersen was a 
member. Significant amounts of synthetic work were performed around these struc- 
tures, but the original agent was still superior and is now in phase I clinical trials and is 
scheduled for phase II (151). 

4.4.9. KRN-7000 

In 1993, workers at the Kirin Brewery, in conjunction with Higa at the University of 
the Ryukyus, reported the first isolation of oc-galactosylceramides from natural sources, 
the marine sponge Agelas mauritianus . Very interestingly, these compounds, the 
agelasphins (Structure 62), demonstrated antitumor and potential immunostimulatory 
activities (152,153). Following these results, and demonstration that these molecules 
were potent in vivo active agents against the murine B16 melanoma, various deriva- 
tives were made, culminating in the production of KRN-7000 (Structure 63) (154,155). 
This compound entered phase I clinical trials in both Asia and Europe in 2001 for 
cancer immunotherapy. 

4.4.10. Squalamine 

This compound, isolated from the common dogfish shark Squalus acanthias, col- 
lected off the New England coast, was originally reported by Zasloff ' s group and col- 
laborators from the University of Pennsylvania in 1993, and was shown to be a fairly 
simple aminosterol (Structure 64) with broad-spectrum antibiotic activity (156). The 
compound was licensed to Maganin Pharmaceuticals (now Genaera) for development, 
entered phase I, and now has progressed into phase II clinical trials for non-responding 
solid tumors, as part of a combination with standard agents, and as primary treatment 
for advanced ovarian cancer. It went into phase I studies for prostate cancer in 2003. 
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There are reports in the literature that it may well have antiangiogenic activity under 
some conditions (157). 

4.4.11. AE-941 or Neovastat 

This is not a true compound, but is probably best thought of as a "defined mixture" 
in that it is a standardized liquid extract comprising the <500-kDa fraction from shark 
cartilage. This material is made under cGMP conditions from taxonomically identified 
shark species harvested under sustainable conditions, and has quality controls that per- 
mitted both the FDA and the Canadian equivalent to give approval for clinical trials. 
The methods of preparation have been published in reasonable detail by Sorbera et al. 
(158), and the first formal report of angiostatic and antitumor activity was given in 
1997 at the American Association for Cancer Research meeting (159). 

The preparation is in many phase III clinical trials in Canada, Europe, and the United 
States, with details of the pivotal studies in phase III renal carcinomas being given in 
two recent reviews (160,161). The initial clinical results from these studies will be of 
great interest. 

4.5. Selected Antitumor Compounds From Marine Sources in Preclinical 
Status 

4.5.1. Tubulin-lnteractive Agents 

These compounds include materials that interact with tubulin, both in a manner simi- 
lar to that of paclitaxel and to the vinca alkaloids, with the compound laulimalide (Struc- 
ture 65) being a very good example as a microtubule-stabilizing agent. Hamel (71) 
indicated that this agent might well bind at a site different from the taxanes, though it is 
possible that it might also be binding to unpolymerized tubulin or to aberrant poly- 
meric tubulin. Over 10 synthetic routes to laulimalide have been published, plus many 
more that give methods of synthesis of "subassemblies" of the overall molecule 
(162,163). 

Other agents that intereact with tubulin include curacin A (Structure 66) from the 
cyanobacterium Lyngbya majuscula (164), vitilevuamide (Structure 67) from the ascid- 
ians Didemnum cuculiferum and Polysyncraton lithostrotum (165) (this compound should 
be compared to those given in the recent review of bioactive peptides by Janin (166)), 
and diazonamide A (Structure 68) from the ascidian Diazona angulata (167) (and its 
subsequent reassessment and synthesis of a revised structure that has biological activ- 
ity by Cruz-Monserrate et al. [168]). 

The sarcodictyins (Structure 69), from the Mediterranean corals Sarcodictyon 
roseum and Eleutherobia aurea, were reported by Pietra's group (169,170) and bear a 
close structural resemblance to eleutherobin (Structure 70), which was originally iso- 
lated by Fenical's group (171) from the Australian octacoral Eleutherobia sp. 
Eleutherobin was reisolated from the Caribbean octacoral Erythropodium caribaeorum 
by Andersen et al. (772) and then reported from whole-organism aquaculture by 
Andersen et al. (173). 

Peloruside A (Structure 71), from the New Zealand marine sponge Mycale hentscheli 
(174), is an interesting case, as the biological activity of this compound induces 
apoptosis following a G 2 -M arrest; its relatively simple structure may well lend itself to 
synthetic modifications (175). Also, as noted by Ghosh and Kim (176), it has structural 
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similarities to the epothilones. Their partial synthesis may significantly aid in the pro- 
duction of enough material to further evaluate the full potential of this compound. 

4.5.2. Vacuolar-ATPase Inhibitors 

Vo-ATPases occur throughout eukaryotes, and their prime function is to pump hydro- 
gen ions from one side of a membrane to the other. These particular enzymes perform 
this function within vacuoles in the cell and are dependent upon ATP for the necessary 
energy to perform the function. 

In 1997, Boyd's group at the NCI reported the isolation of two closely related and 
very cytotoxic novel macrolide structures, the salicylihalimides A and B (Structures 
72, 73), from the Western Australian marine sponge Haliclona sp., with GI 50 values 
below 1 nM for sensitive melanoma lines in the NCI 60 cell line screen (177). These 
agents did not resemble any profiles from the NCI's 60-cell-line dataset of standard 
agent responses, but did show similar patterns to the bafilomycin and concanamycin 
derivatives. The latter compounds are known to exhibit Vo-ATPase inhibitory activity 
but are too toxic for human use, though bafilomycins have been used for plant protec- 
tion. 

Subsequent work from the same laboratory expanded the range of the structures to 
include another marine-derived product series, the lobatamides, and both series were 
shown to be specific for the higher eukaryotic Vo-ATPases but not the fungal equiva- 
lents. Subsequently, similar molecules have been isolated from bacteria, fungi, and 
myxobacteria. Although syntheses have been published, to date, no formal in vivo 
assays of the original compounds have been performed due to a lack of access to the 
natural source from Western Australia. 

This class of compounds is of interest not only as antitumor agents but also in bone 
resorption; thus, they may well have utility in osteoporosis. Two recent reviews have 
been published on these agents (178,179). 

5. The Actual Source(s) of Marine Products? 

The continuing question of whether symbiotic (and/or commensal) microbes are the 
actual producers of cytotoxic and other metabolites is being investigated by a variety of 
groups, and as information evolves, the probability increases that a very significant 
number of polyketide-derived compounds and non-ribosomally produced peptides are 
of microbial origin. A recent review by Janin (166) shows the structures of a large 
number of such molecules. A very recent example is the one given by Pi el on the 
production of pederine by the pseudomonal symbiont from Paederus beetles referred 
to earlier (Subheading 4.4.1.)- In a presentation at the Society for Microbiology meet- 
ing in August 2003, Piel gave further information that demonstrated the formation of a 
very close structural relative of the marine sponge cytotoxins, the onnamides, by this 
terrestrially derived PKS system. 

As can be seen by inspection of the structure of Et743, it bears a close similarity to 
the antibiotics saframycin B, naphthridinomycin, saframycin, and to sponge metabo- 
lites of the renieriamycin class. There are many other agents from marine sources, such 
as the mycalamides and onnamides, that have structures or partial structures similar to 
molecules isolated from terrestrial or other marine phyla, and it was posited from such 
circumstantial evidence that a significant number of ostensibly marine-derived com- 
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pounds, particularly from the Porifera, were in fact derived from commensal and/or 
symbiotic microbes. 

However, it was not until the autumn of 2003 that there was other than circumstan- 
tial evidence of production, when Hill (University of Maryland) and Hamann (Univer- 
sity of Mississippi) reported at the 6th International Marine Biotechnology Conference 
on their work with a purified commensal streptomycete isolated from a deep-water 
Indonesian sponge. They demonstrated that the microbe, when grown in a laboratory 
setting, with some media but not others, produced manzamine A and 8-hydroxy- 
manzamine A, compounds that were isolated from the sponge itself following wild 
collection (180, 181). 

6. Conclusions 

We have shown that natural products from all sources have had and will continue to 
have major influences on the identification and production of agents against cancers of 
all types. Our experience implies that the novel chemotypes directed against cancer 
have tended to come from microbial and marine sources rather than from vascular 
plants. This may simply be due to the centuries of exploration of the plant kingdom 
compared to the other two potential sources of natural products. However, by using the 
power of directed combinatorial library syntheses based on natural products and/or 
natural-product-like molecules, the potential to produce agents that have been modi- 
fied in order to increase their utility as drug candidates has been and continues to be 
demonstrated. Early examples are the modification of the eleutherobin/sarcodictyin 
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skeletons by Nicolaou's group (182), the potential for production of novel epothilone- 
based structures, also by Nicolaou's group (183), and the synthesis of natural-product- 
like libraries by Schreiber's group (184,185). 

If one couples to these chemical techniques, the novel structures that are now being 
reported from total extraction of DNA from soils (186), and the amplification of 
polyketide/nonribosomal peptide synthetases from a large number of different micro- 
bial families, then the horizon is limitless. 

Thus, the final point of emphasis is that the drug discovery research community 
should continue to investigate Nature's resources for leads to novel chemotypes that 
can then be modified and/or extended using the power of all types of modern chemical 
and biochemical techniques. 
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Summary 

Taxus brevifolia is a Pacific yew tree that produces Taxol. Catharanthus roseus is one of the 
most well-studied plants in the world, and has provided a large number of alkaloids, including the 
clinically important antitumor agents vinblastine and vincristine. Both classes of antitumor agents 
interfere with tubulin-microtubule dynamics through opposite modes of action. They are the most 
renowned natural products, have been successfully brought to market, and have also served as leads 
for further optimization. This review highlights the chemical modification of Taxus diterpenoids and 
bisindole Catharanthus alkaloids, with an emphasis on structure-activity relationship studies and 
analog optimizations, leading to the discovery of a new generation of antitumor drugs. 

Key Words: Antitumor agents; Taxus brevifolia; Catharanthus roseus; tubulin; microtubules; 
taxol; bisindole alkaloids; vinblastine; vincristine; structure-activity relationship. 

1. Introduction 

Cancers and related diseases are a major cause of human death worldwide. With the 
development of molecular biology and pharmacology, great progress has been made 
toward a better understanding of these diseases. Among current cancer treatment options, 
only chemotherapy has effectively treated systemic cancers. Such chemotherapy relies 
heavily on the use of natural products and their modified analogs. Among the most 
effective natural products for cancer chemotherapy, vinblastine and paclitaxel (Taxol®) 
are well renowned. They have been successfully brought to market and have also served 
as lead compounds for further optimization. 

Microtubules are dynamic and polymeric structures assembled from two structur- 
ally similar protein subunits, a- and (3-tubulin. Each tubulin contains approx 440 amino 
acid residues. Microtubules play an important role in cellular activities by providing 
pathways for cellular transport processes, constructing the cell cytoskeleton, and they 
are also involved in mitosis. Because cancer cells proliferate more rapidly than normal 
cells, disrupting the cell-division process to stop the growth of diseased cells is clearly 
an important goal and the mode of action for many antitumor agents used in chemo- 
therapy (I). 
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Fig. 1. Selected microtubule stabilizing agents. 



Mechanistically, there are two ways of disrupting the tubulin-microtubule equilib- 
rium: (1) treating with agents capable of promoting tubulin polymerization and thus 
stabilizing microtubules against depolymerization, and (2) treating with agents capable 
of preventing tubulin polymerization, thus preventing the formation of microtubules. 

Several representative tubulin polymerization promoters (type 1 compounds) are 
shown in Fig. 1. These include Taxol, epothilones A and B, eleutherobin, sarcodictyin 
A, peloruside, laulimalide, and discodermolide (2). 

A number of bisindole alkaloids derived from Catharanthus roseus, shown in Fig. 2, 
were found to inhibit the tubulin polymerization process (type 2 compounds). 
Catharanthus roseus (L.) G. Don (Apocynaceae), also commonly known as rosy peri- 
winkle, is one of the most thoroughly studied plants in the world. It is indigenous to 
Madagascar and is cultivated in India, Israel, and the United States (3). To date, more 
than 100 alkaloids have been isolated from the leaves and roots of this plant, including 
an extensive series of bisindole alkaloids (4). Of all these, vinblastine and vincristine 
have become clinically important antitumor agents, marketed as Velban® and 
Oncovin®, respectively. Vinblastine was first discovered in 1958 by Noble at the Uni- 
versity of Western Ontario (5), and independently by Svoboda at the Lilly Research 
Laboratories (6), as an antiproliferative factor in leaf extracts of the periwinkle plant. 
Further purification of those extracts led to the discovery of vincristine (leurocristine), 
leurosidine, and leurosine (7,8). 
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Fig. 2. Selected bisindole alkaloids from Catharanthus roseus. 



Other well-known tubulin polymerization inhibitors, shown in Fig. 3, include 
maytansine, rhizoxin, cryptophycin 1, ustiloxin A, colchicine, and podophyllotoxine 
(9). Vinblastine and vincristine block the cell cycle at the G2/M phase. They exert their 
cytotoxic effect by interfering with the subtle equilibrium between tubulin and micro- 
tubules: vinblastine and vincristine form complexes with tubulin at a site different from 
the site to which colchicine binds, and prevent tubulin polymerization, thus leading to 
the inhibition of microtubule formation and thereby blocking mitosis (10). 

2. Optimization and SAR Studies of Paclitaxel and Analogs From Taxus 
brevi folia 

Natural-product drug discovery has been a well-established process, in which micro- 
organisms possess the remarkable ability to produce drug-like small molecules. Many 
natural products reached the market without any chemical modifications, such as ste- 
roids, (3-lactams, and erythronolides. Indeed, the potential to hit a "home run" with a 
single discovery distinguishes natural products from all other sources of chemical 
diversity and fuels the ongoing efforts to discover new compounds. On the other 
hand, many natural products present important structural motifs and pharmacophores, 
which can be optimized to produce valuable drugs with improved properties (lla-c). 

Paclitaxel 1, an antimicrotubule agent initially isolated from the bark of Taxus 
brevifolia (lid), has garnered considerable attention in recent years due to its efficacy 
in the treatment of various types of cancer, including ovarian, breast, and lung carci- 
noma (12). In addition, paclitaxel has demonstrated promising activity against Kaposi 
sarcoma, bladder, prostate, esophageal, head and neck, cervical, and endometrial can- 
cers (13). Paclitaxel has a unique mechanism of action, which distinguishes it from 
other anticancer agents. The cytotoxic effects of paclitaxel are believed to arise from 
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its ability to promote tubulin polymerization and stabilize microtubules thus formed 
even in the absence of cofactors such as GTP. A major consequence of this shift in 
tubulin-microtubule equilibrium is the inhibition of mitosis (14). The continually 
expanding therapeutic profile of paclitaxel, coupled with its novel mode of action, 
has spurred intense research activity on many fronts, including structure-activity rela- 
tionship (SAR) studies (15). A semisynthetic side-chain analog of paclitaxel, docetaxel 
(Fig. 4, compound 20) was approved by the FDA in 1996 for the treatment of advanced 
breast cancers (16). Although both paclitaxel and docetaxel have demonstrated impres- 
sive antitumor activity, recent reports have shown that the use of these drugs often 
results in undesirable side effects as well as various drug resistance (17). Therefore, it 
is necessary to develop better taxanes with improved efficacy and safety profiles against 
paclitaxel-sensitive and -resistant tumors. With these considerations in mind, we have 
been involved in an analoging program aimed at discovering better taxanes via both 
core and side-chain modifications. Results emerging from rather extensive SAR stud- 
ies at the diterpenoid core clearly show that the functional groups at the northern region 
of the molecule, namely C-7 (18), C-9 (19), and C-10 (20) substituents, contribute 
relatively little to receptor binding, whereas functionalities at the southern region of 
paclitaxel, namely C-2 (21) and C-4 (22) together with oxetane ring (23) and C-l 
(23b,24) are essential elements for tubulin binding and cytotoxicity. 

As a result of fruitful research by scientists at Bristol-Myers Squibb, two core-modi- 
fied paclitaxel analogs— BMS- 184476 (25) and BMS- 188797 (26)— were selected, on 
the basis of their impressive in vitro and in vivo antitumor activities, for human trials 
against various paclitaxel-resistant tumors. In addition, a C-14 hydroxylated paclitaxel 
analog, IDN5109 (SB-T-101131) (27), with additional modifications on its side chain, 
was selected by Bayer Corp. for human clinical trials (Fig. 4). More recently, Iimura et 
al. described an orally active taxotere analog containing morpholinoethyl moiety at the 
C-10 position (28). 
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Fig. 4. Other clinically important taxol derivatives 



In this chapter, we will highlight the effects of SAR modifications at C-2, C-4, and 
oxetane (D-ring) on tubulin polymerization as well as on cytotoxicity determined in 
various taxol-sensitive and -resistant cell lines. In vivo antitumor activities of a few 
selected compounds will also be discussed. 

2. 1. C-2 SAR Investigation 

2.1.1. General Synthetic Route 

A convergent synthetic route for the preparation of C-2 modified taxol analogs was 
reported by Kingston et al. (21d). As outlined in Structure 1, C-2 & C-7 bis-silylated 
taxol (compound 24) was subjected to a phase-transfer catalyzed debenzoylation con- 
dition to yield the corresponding C-2 hydroxyl derivative (compound 25). This inter- 
mediate was then condensed with the requisite C-2 acids to provide the respective C-2 
ester derivatives (compound 26), which were converted to the final products (com- 
pound 27) upon desilylation. 

2.1.2. C-2 SAR Trends 

In 1993, Chen et al. reported that the C-2 deoxytaxol (compound 27 f) was found to 
be > 100-fold less cytotoxic than Taxol against HCT-116 cell lines (a human colon 
cancer cell line), suggesting that the C-2 benzoate was involved in the intimate binding 
of taxol with its receptor (21a). Subsequent data from Chen et al. (21b) and Kingston et 
al. (21c,d) reported on the cytotoxicity values (IC 50 nM in HCT-1 16) determined for a 
set of C-2-substituted benzoates— compounds 27a (>88), 27b (0.70), 27c (3.8), 27d 
(0.36), and 27e (2.8). Compared with the cytotoxicity obtained with paclitaxel (IC 50 2- 
2.5 nM), compounds 27b and 27d were found to be more potent than paclitaxel. In view 
of the data obtained with compound 27a, it is clear that para-substitution was not toler- 
ated. A separate report from Georg et al. demonstrated that, on the basis of cytotoxicity 
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determined in the B16 melanoma cell line, the C-2 phenyl-bearing analog 27g was 
about fivefold more potent than its corresponding C-2 saturated counterpart 27h (lie). 
This result was in good agreement with that disclosed by Ojima and his collaborators 
on the corresponding taxol analogs (21f). More recently, Kadow and Chen prepared a 
series of structurally unique C-l/C-2 orthoester derivatives, including compound 27i 
shown in Fig. 5. Interestingly, this novel analog retained good cytotoxicity against 
HCT-116, with anIC 50 value of 6 nM (21g). It is also worthwhile to mention that the C- 
2 epi-taxol 27j was devoid of bioactivity in both the tubulin-binding assay and cytotox- 
icity assay when tested against the HCT-116 cell line (21h). 

More recently, Fang et al. showed that the C-2 benzamide bearing derivative 27k 
was 16-fold less cytotoxic than paclitaxel in KB cells (21i). These recent results pro- 
vide further evidence for the importance of the characteristics and stereochemistry of 
the C-2 functionality for the bioactivity of paclitaxel. 

Encouraged by the excellent potencies detected in the paclitaxel-sensitive cell line 
HCT-1 16, a number of promising C-2 analogs were further evaluated in the paclitaxel- 
resistant cell line 1A9(PTX22). This cell line expresses an altered (3-tubulin that con- 
fers 21 -fold resistance to paclitaxel. When tested against the 1A9(PTX22) cell line, 
compounds 27c and 27d showed 15 -fold cross-resistance relative to the corresponding 
sensitive cell line. In contrast to paclitaxel, the C-2 meta-azidobenzoate derivative 27e 
encountered only fivefold resistance (21d). 

2.2. C-4 SAR Studies 

2.2.1. General Synthetic Route for C-4 Derivatization 

As shown in Structure 2, two novel series of C-4 ester and carbonate paclitaxel 
analogs were synthesized from the key intermediate, 4-deacetyl baccatin derivative 30, 
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which was in turn prepared according to Chen's protocol (Red-Al/THF/0°C) in two 
steps from 7,13-bisTES baccatin III (compound 28) (22a). Deprotonation of compound 
30 with lithium bis(trimethylsilyl)amide, followed by reaction with an acyl chloride or 
a chloroformate, afforded the desired C-4 ester baccatin (compound 31) or the C-4 
carbonate baccatin (compound 32) in 42-94% yield. These two intermediates were 
converted, via desilylation (at C-l, C-7 and C-13) and subsequent mono-resilylation, 
to their corresponding 7-TES baccatin derivatives 33 and 34. Final side-chain attach- 
ment (at C-13) onto 33 or 34 led to two novel series of C-4-modified paclitaxel ana- 
logs, compounds 36 and 37, respectively (22a). 

2.2.2. C-4 SAP, Trends 

As shown in Table 1, the C-4 deacetyl derivative 38 was considerably less potent 
than paclitaxel in the tubulin polymerization and the cytotoxicity assays (22b, e). In 
1996, Chen et al. reported that the C-4 methyl ether analog 39 displayed 18-fold weaker 
cytotoxicity in comparison to paclitaxel (22 d). These results clearly show that the C-4 
acyl moiety is an important element in receptor binding. 

To further explore the size of the C-4 binding pocket, Chen and his collaborators 
synthesized and evaluated a large number of C-4 analogs, including various C-4 esters 
and C-4 carbonates highlighted in Table 1. Interestingly, two C-4 benzoyl bearing 
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derivatives (compounds 36a and 36b) were significantly less active than paclitaxel, 
suggesting that the binding pocket at C-4 is perhaps limited to 6-carbon alkyls (22b). 
In sharp contrast to the results mentioned above, all of the aliphatic esters 36c-h 
exhibited good to excellent activities in the tubulin and the cytotoxicity assays. A rather 
important trend was observed simply by comparing the cytotoxicity data of the follow- 
ing five C-4 straight chain (two to six carbons) ester analogs: 1 (paclitaxel), 36c, 36d, 
36f, and 36h. It is obvious that the 4-butyrate ester 36d was the most potent analog 
within that subset. This trend seemed to suggest that the four-carbon chain is probably 
the optimal size for effective receptor binding. In order to further optimize activity, 
other four-carbon-bearing C-4 esters such as C-4 cyclopropyl ester analog 36e were pre- 
pared. The compound 36e was found to be the most potent ester derivative in Table 1 
(22a,b). Interestingly, the C-4 aziridine carbamate derivative 40, a closely related ana- 
log of the C-4 ester 36e, was found to be 10-fold less potent than 36e in the tubulin 
polymerization assay (22c). As also documented in Table 1, two C-4 carbonate ana- 
logs, 23 and 37b, exhibited two- to fourfold improved potency relative to paclitaxel in 
both the tubulin binding assay and the cytotoxicity assay against HCT-1 16 human colon 
carcinoma cell line (22b). 
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2.2.3. Side-Chain Modified Novel C-4 Esters and Carbonates 

In light of recent reports indicating that replacement of the 3'C-phenyl and 3'N-Bz 
with their respective 3'C-furyl and 3'N-Boc could lead to paclitaxel side-chain analogs 
possessing enhanced in vitro activity (29), several potent C-4 esters (36d and 36e) and 
carbonates (23 and 37b) (Fig. 6) were further derivatized into their side-chain analogs 
for biological evaluation. The in vitro cytotoxicity against HCT-116 and HCT- 
1 16(VM46) and in vivo efficacy (ip) of these analogs are summarized in Table 2. 
2.2.3.1 . In Vitro Evaluation Against Sensitive and Resistant HCT-1 1 6 Cells 

The general conclusion that emerges from the in vitro data is that side-chain modifi- 
cations do not seem to have a significant impact on the cytotoxicity of these C-4 esters 
and C-4 carbonates as listed in Table 2. In fact, none of the 3'-C and/or 3'-N replace- 
ments gained a clear benefit over the corresponding parent compounds. For example, 
four pairs of C-4 cyclopropyl esters (compounds 36e, 42-44) and C-4 methyl carbon- 
ates (compounds 23, 45), regardless of their side-chain substituents, displayed almost 
equal potencies in the cytotoxicity assay against the HCT-1 16 cell line (22b). 

Careful analysis of the R/S ratio listed in Table 2 clearly indicates a moderate increase 
in size at C-4 resulted in an improved ability to overcome resistance in the HCT- 
1 16(VM46) cell line. This cell line is more than 100-fold more resistant to paclitaxel 
due to overexpression of P-glycoprotein (Pgp), which results in decreased intracellular 
concentrations of paclitaxel and leads to the multidrug-resistance phenotype (MDR). 
Although clinical resistance for paclitaxel is not well understood, Pgp overexpression 
and altered tubulin have been suggested as two possible mechanisms. Remarkably, as 
shown in Table 2, replacement of the C-4 acetoxy moiety in paclitaxel with either 
cyclopropyl ester (2.4-fold for 36e) or ethyl carbonate (2.9-fold for 37b) effectively 
overcame Pgp-mediated resistance to paclitaxel. In these two cases, further side-chain 
modifications did not impact the R/S ratio very much (see data obtained for compounds 
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Table 1 

In Vitro Potencies of Paclitaxel Side-Chain Bearing C-4 Analogs 
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* Ratio obtained in the tubulin polymerization assay, and ratios < 1.0 signify ana- 
logs more potent than paclitaxel. 



A7.-\A and 46 for detail). On the other hand, for other C-4 analogs 36d and 23, such 
side-chain modifications (t-Boc as the 3'-N cap and/or 2-furyl as the 3'-C replacement) 
had a significant impact on the R/S ratio. The resulting nonpaclitaxel side-chain ana- 
logs 41 and 45 exhibited impressive R/S ratios, ranging from 0.5 to 3.6 (26). 

2.2.3.2. In Vivo Evaluation (M-109 Lung Carcinoma) 

As shown in Table 2, of the 10 C-4 esters and carbonates tested against murine M-109 
lung carcinoma (22b,26c), 7 of them were active (T/C > 125%). The best results were 
obtained with the paclitaxel side chain bearing 4-cyclopropyl ester 36e and one 
nonpaclitaxel side chain containing 4-methylcarbonate 45, with T/C (%) values of 188 
and 1 83, respectively. It is also worthwhile to note that some of these C-4 analogs were 
10-15 times more potent than paclitaxel (as indicated by the dose mg/kg values). These 
include two C-4 cyclopropyl esters (36e, 43) and two C-4 ethyl carbonate derivatives 
(37b, 46). However, attempts to improve the in vivo efficacy by means of further side- 
chain modifications were not successful. For instance, 3'C-furyl/3'-N-Boc bearing C-4 
cyclopropyl analogs of 42-44 and 41 possessed reduced activities (as measured by T/C 
values) as compared with their corresponding parent analogs 36e and 36d. In one case, 
C-4 methyl carbonate 45 exhibited slightly better in vivo activity than the paclitaxel 
side-chain-bearing analog 23. No side-chain effect was seen with C-4 ethyl carbonate 
analogs (37b vs 46). In view of these discrepancies, it is fair to say that side-chain 
modification provided at best only minimal improvements in in vivo efficacy with the 
M-109 (ip) tumor model (22b,26c). 

2.2.3.3. Further In Vivo Evaluations 

Promising C-4 esters and carbonates with demonstrated in vivo efficacy in the M-109 
solid tumor model (ip) were tested in one or more secondary distal tumor models (iv 
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administration of drug to a mouse bearing a subcutaneously implanted tumor) in an 
effort to identify improved paclitaxel(s). To our disappointment, none of these com- 
pounds were found to be more efficacious than paclitaxel. It became apparent in the C-4 
analog series that an enhanced ability to overcome MDR resistance in vitro did not corre- 
late with expected enhanced in vivo efficacy. This observation parallels the results of 
Nicolaou's group obtained with C-2 taxoids (29e), as well as our own findings with 
C-7 paclitaxel derivatives (18). Despite limitations seen with the initial ip/ip M-109 
tumor model, the C-4 cyclopropyl ester 36e and the C-4 methyl carbonate derivative 
45 demonstrated the best activities. In light of the fact that furyl/Boc side-chain ana- 
logs in the C-4 cyclopropyl series (42-44) were inactive in vivo, this suggested that the 
in vivo activity of the C-4 methyl carbonate series might be more robust in tolerating 
the incorporation of a more potent side chain. To confirm this hypothesis, we decided 
to evaluate C-4 methylcarbonate 23 (BMS-1 88797) in distal tumor models (against 
L2987 lung carcinoma and HCT/pk colon carcinoma), despite its modest ability to 
overcome resistance in vitro. To our satisfaction, BMS-188797 demonstrated better 
efficacy than paclitaxel in these distal tumor models after iv administration (30). On 
the basis of these data, BMS-188797 was selected for further evaluation in phase I and 
II clinical trials. 

2.3. Oxetane Ring Modification 

In order to probe the importance of the intact oxetane ring for bioactivity, several 
oxetane ring opened analogs (e.g., compounds 47 and 48) as well as oxetane ring oxy- 
gen modified analogs (e.g., compounds 49-52) were prepared and evaluated for their 
tubulin assembly activity and in vitro cytotoxicity against cancer cell lines (see Fig. 7). 
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Fig. 7. Oxetane ring-modified taxol or taxotere derivatives. 



Pioneering work by Kingston showed that D-secotaxol derivatives such as com- 
pound 47 were devoid of biological activity (23a). On the basis of this result, it was 
concluded that the oxetane ring is essential for interaction with microtubules and hence 
cytotoxicity. Since the key C-4 acetate moiety was absent in compound 47, further 
research continued to search for compounds bearing both the intact C-4 functionality 
as well as a modified oxetane ring. In 1993, Chen and his collaborators prepared an 
interesting bicyclic D-ring containing analog 48 with the intact C-4 acetate, as seen in 
paclitaxel. Surprisingly, compound 48 also showed significantly reduced bioactivity 
relative to paclitaxel (23b). The lack of bioactivity observed with compound 48 seemed 
to suggest that the oxetane ring is required for activity. A recent report from Dubois et 
al. detailed synthesis and evaluation of two 5(20)-aza-docetaxel (taxotere). When tested 
against the KB cell line in vitro, both compounds 49 and 50 were found to be inactive. 
In addition, compound 50 showed 16-fold weaker potency than taxotere, and com- 
pound 49 was essentially inactive in the tubulin polymerization assay (23c). The nitro- 
gen atom in the azetidine ring (D-ring) would be protonated at neutral pH and thus 
would possibly interact with tubulin at a different site from that found with the neutral 
oxygen atom in taxotere. For these reasons, the role of the oxetane ring on the activity 
of paclitaxel or docetaxel was still not well understood. In order to pinpoint the role of 
the oxetane ring oxygen atom on activity, Kingston and his collaborators embarked on 
the synthesis and comparative evaluation of 5,20-thiapaclitaxel derivative 52, with its 
corresponding C-4 methyl carbonate paclitaxel analog 51. When tested in the tubulin 
polymerization and cytotoxicity assays, compound 52 had negligible activity in all 
assays, with IC 50 values >1 \iM (Burkitt lymphoma) and >2.5 uM (prostate carcinoma). 
In contrast, compound 51 was clearly superior in activity to paclitaxel and even 
docetaxel. The IC 50 values detected with compound 51 were 2 nM (CA46, Burkitt 
lymphoma), 1 nM (PC3 prostate carcinoma), and 0.2 nM (MCF-7, breast carcinoma). 
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Table 3 

Biological Activity of Bisindole Alkaloids in the CRFF-CEM 

T-Cell Leukemia Cell Line 

IC 50 IC 5( ) 

Compds (pg/mL) Compds (p.g/mL) 

vincristine 0.025 vinrosidine 1.0 

vinblastine 0.1 vinamidine 7.7 

vinleurosine 0.4 



On the basis of these results, Kingston concluded that the D-ring region of the 
pharmacophore of paclitaxel is very sensitive to steric effects, and the oxygen atom in 
the oxetane ring is acting as a hydrogen-bond acceptor (23d). It is worthwhile to point 
out that the similar 5(20)-thiadocetaxel was later prepared by Potier and his co-workers 
(23e). 

3. Optimization and SAR Studies for Designed Bisindole Catharanthus 
Alkaloid Analogs 

For several decades, vinblastine and vincristine have become clinically important 
antitumor agents due to their strong antineoplastic activity against a wide spectrum of 
human tumors (31 ) (Table 3). These two alkaloids, although structurally almost iden- 
tical, differ markedly in the type of tumors that they affect and in their toxic propertities. 
In particular, vinblastine is used in the treatment of Hodgkin's disease, whereas vinc- 
ristine is considered the drug of choice in treating acute childhood leukemia, and is 
more widely used (32). 

Vinblastine and vincristine are highly potent drugs, which also have serious side 
effects, especially on the neurological system (33). Therefore, development of new 
synthetic analogs with the goal of obtaining more effective and less toxic drugs is 
desirable (34). This long-standing interest has led to worldwide research programs that 
continue today, to investigate structural modifications of these alkaloids. Research has 
mainly focused on the partial or total synthesis of vinblastine analogs from the two 
parts of the dimeric structure, the Iboga and Aspidosperma portions, by carbon skel- 
eton modification and functional group transformation. To date, these efforts have led 
to more than a dozen candidates in clinical evaluation. Optimization and SAR studies 
of vinblastine structure will continue for the foreseeable future, in order to obtain new 
generations of effective antitumor drugs. 

3.1. SAR Studies on the Velbanamine Portion: Discovery of Vinoreibine 
and Vinflunine 

Although the upper moiety of vinblastine has been less studied due to its lack of 
functionalities, it is still considered to be a very important region in terms of the 
potency and novelty of analogs. Modification of the upper velbenamine moiety via 
the rearrangement of catharanthine is less easily accessible and would require many 
steps. However, several SAR observations can be deduced from the biological evalua- 
tion of various analogs that have been prepared (35). 
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Structure 3. Potiers synthesis of vinorelbine via second modified Polonovski reaction. 



The stereochemical configurations at C16' and C14' in the velbanamine portion are 
critical. Inversion of the configurations leads to loss of activity. The CI 6' carbomethoxy 
group of the velbanamine portion is also important, since the decarboxylated dimer is 
inactive. 

Structural variation at C15'-C20' of the velbanamine portion is well tolerated. 
Leurosidine (the C20' epimer), leurosine (the epoxide), the C20'-deoxy derivative, the 
C15'-C20' dehydro-derivative, and C20'-desethyl derivative, all exhibit different inhi- 
bition activities for microtubule assembly (36). 

Several A^4'-oxides have been prepared by oxidation of the parent compounds with 
peroxide. They have shown significantly reduced antitumor activity as well as reduced 
tubulin-binding affinity. Other skeletal modifications, such as removal of the D-ring of 
vinblastine, have been reported. One example, vinamidine (also known as catharinine), 
with the D-ring cleavage, has resulted in a near complete loss of biological activity. 

One of the most successful examples of upper-region modified vinblastine analogs is 
vinorelbine (compound 57), which retains excellent activity but is less neurotoxic than 
vincristine. It was prepared by a second Potier-Polonovski reaction with anhydrovinblastine 
(compound 53). As shown in Structure 3, the resulted bisiminium species was hydrolyzed 
and recyclized to provide skeleton-modified vinorelbine (37-39). 

The antitumor activity of navelbine has been evaluated in many murine tumor mod- 
els. Results have shown that it demonstrated remarkable activity against leukemia 
models, although it was inactive in carcinoma and fibrosarcoma models. Vinorelbine is 
currently available worldwide for the treatment of non-small cell lung cancer and of 
breast cancer. It is orally active, and its use will almost certainly be extended to the 
treatment of other cancerous diseases. 

Super acid chemistry has been applied to prepare vinflunine (compound 58, Struc- 
ture 4), the difluoro-derivative of vinorelbine. This compound has improved in vivo 
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antitumor activity over vinorelbine against B 16 melanoma and a panel of human tumor 
xenografts. It is currently undergoing phase I clinical evaluation (40). In a similar fash- 
ion, a new family of anhydrovinblastine and vinorelbine analogs was prepared using 
super acid chemistry. Halogen atoms, ketone, and hydroxy groups are introduced into 
the upper velbanamine portion (41). 

It was discovered that fluorination at CI 9' position of vinorelbine dramatically 
increases the in vivo activity, while hydroxylation at the same position results in the 
total loss of activity. In the anhydrovinblastine series, monohalogenation of the C19' 
position does not affect the activity, but dihalogenation significantly decreases the ac- 
tivity. In contrast to the above observation, the same modifications greatly decrease the 
in vivo activity for the vinorelbine series. In addition to the above findings, deacetylation 
at the C17 position of dihydrovinorelbine and vinflunine resulted in increases of both in 
vitro cytotoxicity and in vivo activity. 

Another strategy to synthesize new upper-region-modified vinblastine analogs has 
been established as shown in Structure 5 (42). The preliminary SAR results suggested 
that the integrity of the velbanamine skeleton in the bisindole alkaloids is important for 
maintaining their potency. Only one of the tetrahydro-isoquinoline derivatives (com- 
pound 64) exerted a marginal activity on the tubulin polymerization inhibition test. 
Other derivatives lacking the C5'-C6' bond in the C-ring were found to be inactive. 

3.2. SAR Studies on the Vindoline Portion 

Most of the SAR studies involve the vindoline portion of bisindole alkaloids. This is 
due to the fact that the presence of various functionalities, such as the C16 and C17 
positions, offers good opportunities for derivatizing new analogs. Many functional 
groups at C16 have been found to be equivalent to the ester "pharmacophore" at this 
position. These findings have led to the discovery of several new generations of ana- 
logs such as vindesine and vinzolidine. 

3.2.1. Vindoline's Indole Methyl Derivatives 

Vinepidine (LY 1 19863) (compound 66) is a 20'-deoxy epimer of vincristine (43). 
As shown in Structure 6, it can be prepared from the 20'-e/?/-deoxyvinblastine by 
oxidation of the vindoline portion's methyl group. In terms of antitumor activity, 
vinepidine was more efficient than vincristine against lymphosarcoma and leukemia. 
Although its biochemical pharmacology, antitumor efficacy, and potency led to a phase 
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I clinical study, vinepidine' s observed neuromuscular toxicity prevented its further 
clinical development (44,35). 

Vinformide (compound 68), also known as A^-formylleurosine, is an A^-formyl ana- 
log of leurosine. It was prepared from leurosine by oxidation of the vindoline portion's 
methyl group with various oxidants, including chromic acid (45). Another synthesis 
started from vincristine reacted with Vilsmeier reagent to produce anhydro vincristine, 
which was later epoxided to yield vinformide (46). Vinformide was found to be 20 
(P388 cell) or 1000 (K562 cell) times less toxic than vincristine, and is useful in the 
treatment of lymphoma, leukemia, and Hodgkin's disease. On the other hand, in spite 
of its promising antineoplastic potency, vinformide also exerts an acute cardiotoxic 
side effect (47). 

Other N\ derivatives have been reported, and they are shown in Structure 7. Many 
of them have demonstrated certain antitumor activities as well as improved therapeutic 
index compared with vinblastine. This finding indicates that vindoline' s indole methyl 
group is a useful position to functionalize potentially and provide more potent vinblas- 
tine derivatives (48). 

3.2.2. Analogs Derived From the C16 Carbomethoxy Group: Discovery 
of Vindesine and Analogs 

Vindesine (LY 99094) (compound 71) is a chemically modified vinblastine analog 
that differs slightly from vinblastine by having an amide group instead of an ester 
group at CI 6. Such minor differences are responsible for the profound alteration in the 
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antitumor spectrum, potency, and toxicity of these compounds. In particular, vindesine 
possesses an antitumor spectrum against rodent tumor systems that resembles vincris- 
tine, rather than its parent vinblastine, while its neurotoxic potential appears to be less 
than that of vincristine. Vindesine is prepared from vinblastine or deacetylvinblastine 
by various methods, as shown in Structure 8 (49). 

The selection of vindesine for clinical evaluation provided an opportunity to explore 
the consequences of minor structural changes within the "bottom" vindoline portion. 
Clinical trial data indicated that vindesine is an active oncolytic agent that appears to 
be less neurotoxic than vincristine. Further studies also indicated that vindesine is active 
in vincristine-resistant childhood leukemia. When used in combination with cisplatin, 
vindesine showed efficacy in treatment of non-small cell lung cancer. It is currently 
available for the treatment of leukemia in several countries. 

An extensive series of ,/V-substituted vindesine analogs has been prepared from the 
reaction of deacetyl-vinblastine acid azide (compound 72, Structure 9) with the appro- 
priate amines. These Af-alkylvindesines have reduced activity. In terms of collective 
antitumor activity, vindesine emerges as the congener with optimum qualities (50). 



186 



Hong and Chen 



vinblastine 




2. HNO 



Structure 9. Preparation of vindesine analogs (/V-alkylvindesines). 



R'\,C0 2 Et 




72 



OH 

i H CON 3 
Me J 



NH 2 75 




76a R = -L-Ile-OEt 

76b R = -L-Trp-OEt (vintriptol) 



Structure 10. Synthesis of vinblastin-16-oyl amino acid derivatives. 



Similarly, a group of 21 vinblastine-C16-carboxyl amino acid derivatives (com- 
pound 76, Structure 10) were synthesized by linking amino acid carboxylic esters to 
the vinblastin-C16-carboxyl moiety through an amide linkage. The influence of the na- 
ture of the amino acid, ester alkyl chain lengths, the stereoisomerism of the amino acid, 
and the reacetylation of the hydroxyl group at the C17 position of the vindoline moiety 
have been studied. Among those 21 congeners, vinblastine-L-Trp-OEt (compound 76b) 
and vinblastine-L-Ile-OEt (compound 76a) stand out as the best derivatives, based on 
their antitumor activities. Further studies also showed that the presence of a tryptophan 
at the C16-carboxyl moiety diminished the toxicity. Therefore derivative vinblastine- 
L-Trp-Oet (compound 76b), known as vintriptol, was selected for phase I and II clinical 
trials (51,52). 

A series of new amino phosphonic acide derivatives of vinblastine has been synthe- 
sized and tested in vitro and in vivo for antitumor activity. All of these compounds 
were capable of inhibiting tubulin polymerization in vitro. The antitumor activity 
strongly depended on the stereochemistry of the phosphonates. The most active com- 
pounds have the ^-configuration. One of them, a compound with an isopropyl group 
(R = z'-Pr) (compound 78), exhibited a remarkable activity against cancer cell lines both 
in vitro and in vivo, and was brought into phase I clinical trials (53). 

Cephalosporin substituted at the C3' position with the potent oncolytic agent 
desacetylvinblastine hydrazide was synthesized as a potential prodrug for the treat- 
ment of solid tumors. The design of this novel prodrug was based on the knowledge 
that hydrolysis of a cephalosporin's (3-lactam bond can result in the expulsion of the 
C3' substituent. As shown in Structure 12, treatment of these candidate prodrugs (com- 
pounds 79-81) with the P99 (3-lactamase enzyme efficiently catalyzed their conversion 
to the free drug form (54). 
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Napavin (compound 84, Structure 13), a photo-reactive vinblastine derivative, was 
synthesized from vinblastine. In contrast to vinblastine, which binds noncovalently to 
the intracellular protein tubulin, napavin was designed to form a covalent bond with 
target proteins upon irradiation, to prevent its elimination and thus increase the half- 
life of its action. It was found that as new photo-reactive cytostatic substance, napavin 
can overcome multidrug resistance of tumor cells (55). 

3.2.3. The C16 Spiro-Fused Analogs: Discovery of Vinzolidine and Analogs 

Different from the vast majority of the C16 analogs of vinblastine, a group of C16 
spiro-fused derivatives represents a novel class of semisynthetic vinblastine analogs. 
In particular, the C16 spiro-fused oxazolidine-l,3-dione (compound 85a-c) can be 
prepared in two ways, as shown in Structure 14. A number of the bisindole 
oxazolidinedione analogs have been prepared using these methods. Their ability to 
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induce a mitotic block and their antitumor efficacy were examined. Vinzolidine (LY 
104208) (compound 85c) stood out as exceptionally active and well absorbed when 
administered by the oral route. These results have allowed vinzolidine to undergo fur- 
ther clinical studies (56,57). 

As shown in Structure 15, spirolactone (compound 88) was formed during the re- 
action of paraformaldehyde with 17-deacetylvinblastine under acidic conditions. Its 
chemotherapeutic activity was assessed for experimental P338 leukemia. The data 
showed that this compound had a much better activity than vinblastine (58). 

A new series of semisynthetic C16-spiro-oxazolidine-l,3-diones (compound 89a-c) 
were prepared from 17-deacetoxy-vinblastine using one of the standard methods. As 
shown in Table 4, it was found that compound 89b had lower toxicity than vinblastine 
and exerted different pharmacological effects. The spiro-oxazolidine ring and the sub- 
stitution of a formyl group for a methyl group were responsible for the unique phar- 
macological effects observed. The studies also showed that these compounds 
displayed their cytotoxic activities at significantly higher concentrations than parent 
compounds, although their antimicrotubular activities were similar in vitro (59). 

3.2.4. The C17 Hydroxyl Group-Derived Analogs and Prodrugs 

Vinglycinate (LY 49040) (compound 91) is a glycinate prodrug derived from the 
C17 hydroxy group of vinblastine. It was prepared from vinblastine as shown in Struc- 
ture 16. Vinglycinate demonstrated measurable activity against several leukemia mod- 
els and was orally absorbed. Its toxicity and the experimental antitumor spectrum were 
found to be very similar to that of vinblastine. As a result, vinglycinate was brought 
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Table 4 

Biological Activity of Vinzolidine Analogs in Various Tests 
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Structure 16. Synthesis of vinglycinate. 
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Table 5 

In Vitro Biological Activity of Several Bisindole Alkaloid Analogs 
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into clinical studies. Further clinical development was stopped because it showed no 
improvement over vinblastine in terms of efficacy and toxicity (60,35). 

In a fashion similar to the preparation of vinglycinate, a series of 17-(oc-aminoacyl) 
derivatives (compounds 92a-d) have been prepared. As shown in Table 5, the antitu- 
mor activity of these compounds was examined. It was shown that the larger size of the 
acyl substitutent at C17 reduced the antitumor activity significantly. Several analogs 
derived from the C17 hydroxyl group have been prepared, and the activities have been 
tested. The results in Table 3 showed the same trend (61). 

A series of peptidyl conjugates at the C17 position of vinblastine have been devel- 
oped for solving the marginally effective and dose-limiting systemic toxicity issues. 
One of them, compound 95, as shown in Structure 17, was evaluated as a prodrug 
targeted to prostate cancer cells. This compound features an octapeptide segment at- 
tached by an ester linkage at the C17 position of vinblastine, and undergoes rapid cleav- 
age by prostate-specific antigen (PSA) (T 1/2 = 12 min) between the Gin and Ser 
residues. In nude mouse xenograft studies, it reduced circulating PSA levels by 99% 
and tumor weight by 85% at a dose just below its MTD (62). 

4. Summary and Future Perspectives 

Since the first isolation and structural elucidation of Taxus diterpenoids and bisindole 
Catharanthus alkaloids, tremendous synthetic and medicinal chemistry efforts have 
been pursued and disclosed in the literature. Vinblastine and vincristine have served as 
lead structures for discovery chemistry and lead optimization. 
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Structure 17. Synthesis of peptidyl prodrugs of vinblastine. 



The discovery chemistry of the bisindole alkaloids and their biosynthetic precursors 
that occur in the same plant has culminated in the development of many novel efficient 
synthetic routes that are applicable for SAR studies and process development. All these 
efforts have enriched our knowledge in both synthetic organic chemistry and medicinal 
chemistry research areas. 

The original biosynthetic hypothesis and subsequent synthetic studies led to an 
understanding of the biosynthetic pathway, allowing us to couple both catharanthine 
and vindoline to access bisindole alkaloids. As a result of the discovery chemistry, 
many natural as well as semi-synthetic analogs of vinblastine were prepared via the 
biosynthetic concepts and newly developed methodologies. In addition to the above 
development, other areas of pharmacokinetics and pharmacology have been extensively 
explored, and several allied disciplines have been well integrated. For example, struc- 
tural-activity relationship studies have led to an understanding of various pharma- 
cophores present in the molecule. Medicinal chemistry also led us to probe the relevant 
biology, especially mode of action and tubulin-microtubule dynamics. New genera- 
tions of vinblastine analogs, represented by navelbine, vindesine, and vinzolidine, as 
well as vinflunine and many others, have been brought into clinical studies, and many 
of them have been successful in reaching the market. 

Future progress will focus on the necessity of developing a three-dimensional struc- 
ture of the vinblastine binding site, which will benefit our rational drug design. Such a 
ligand-receptor model will also allow us to probe the tubulin-microtubule dynamics at 
a molecular level. Meanwhile, a suitable screening model for clinically useful bisindole 
alkaloids needs to be defined. Other research horizons will arise from the development 
of bio-conjugates or their hybrids with bisindole alkaloids, as well as from our increas- 
ing knowledge of the molecular mechanisms of drug resistance. 

With the goal of identifying paclitaxel derivatives retaining in vivo efficacy against 
paclitaxel-resistant tumors, scientists from many pharmaceutical industries and aca- 
demic institutions synthesized and evaluated various paclitaxel analogs containing 
modifications at the core or/and side-chain regions. As a result of this intensive effort, 
BMS-184476 (25), BMS-188797 (26), and IDN5109 (27) were selected, on the basis 
of their excellent in vitro activity and in vivo activity in a number of animal models, for 
clinical evaluation against a number of solid tumors. The future success of these novel 
taxanes in clinical trials should provide clinicians with more treatment options for com- 
bating various paclitaxel-resistant cancers. In addition, the success of orally active 
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paclitaxel analogs in clinical trials should reduce the cost of chemotherapy by eliminat- 
ing the need for premedication, as required for paclitaxel treatment. 

In conclusion, the discovery of the Taxus and Catharanthus family of natural prod- 
ucts will continue to provide good opportunities to explore chemical biology, leading 
to the generation of new clinically useful drugs for the treatment of human cancers. 
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Summary 

Terpenoids, also referred to as terpenes, are the largest group of natural compounds. Many 
terpenes have biological activities and are used for the treatment of human diseases. The worldwide 
sales of terpene-based pharmaceuticals in 2002 were approximately US $12 billion. Among these 
pharmaceuticals, the anticancer drug Taxol® and the antimalarial drug Artimesinin are two of the most 
renowned terpene-based drugs. All terpenoids are synthesized from two five-carbon building blocks. 
Based on the number of the building blocks, terpenoids are commonly classified as monoterpenes 
(C 10 ), sesquiterpenes (C 15 ), diterpenes (C 20 ), and sesterterpenes (C 25 ). These terpenoids display a wide 
range of biological activities against cancer, malaria, inflammation, and a variety of infectious dis- 
eases (viral and bacterial). In last two decades, natural-product bioprospecting from the marine 
environment has resulted in hundreds of terpenoids with novel structures and interesting bioactivities, 
with more to be discovered in the future. The problem of supply is a serious obstacle to the develop- 
ment of most terpenoid compounds with interesting pharmaceutical properties. Although total chemi- 
cal synthesis plays a less important role in the production of some terpenoid drugs, it has contributed 
significantly to the development of terpenoid compounds and terpene-based drugs by providing 
critical information on structure-activity relationships (S AR) and chiral centers as well as generating 
analog libraries. Semisynthesis, on the other hand, has played a major role in the development and 
production of terpenoid-derived drugs. Metabolic engineering as an integrated bioengineering 
approach has made considerable progress to produce some terpenoids in plants and fermentable hosts. 
Cell culture and aquaculture will provide a solution for the supply issue of some valuable terpenes 
from terrestrial and marine environments, respectively. Recent advances in environmental genomics 
and other "-omics" technologies will facilitate isolation and discovery of new terpenoids from natural 
environments. There is no doubt that more terpenoid-based clinical drugs will become available and 
will play a more significant role in human disease treatment in the near future. 

Key Words: Natural products; bioactive terpenoids; biosynthesis; chemical synthesis; drug 
discovery and development; sustainable production. 

1. Introduction 

Natural products have played a significant role in human disease therapy and pre- 
vention (1). More than 60% and 75% of the chemotherapeutic drugs for cancer and 
infectious disease, respectively, are of natural origin (2). With more than 23,000 known 
compounds, terpenoids, also referred to as terpenes, are the largest class of natural 
products. Among this group, many interesting compounds are extensively applied in 
the industrial sector as flavors, fragrances, and spices, and are also used in perfumery 
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and cosmetics products and food additives (1,3,4). Many terpenes have biological activi- 
ties and are used for medical purposes. For example, the antimalarial drug Artemisinin 
and the anticancer drug paclitaxel (Taxol®) are two of a few terpenes with established 
medical applications. Natural products continue to be one of the most important sources 
of lead compounds for the pharmaceutical industry. At the same time, more terpenes 
have been discovered as efficacious compounds in human disease therapy and preven- 
tion. In particular, marine chemists and biologists have identified many marine terpe- 
nes with promising potential for medical applications. For instance, eleutherobin and 
sarcodictyin A from the Australian soft coral Eleutherobia sp. (5) and the stoloniferan 
coral Sarcodictyon roseum (6, 7), respectively, exhibit potent antitumor activity against 
a variety of tumor cells. Therefore, terpenoids presumably will play an increasingly 
important role as therapeutic and preventative agents for human diseases. In this chap- 
ter, we review the status of terpenoids as potential pharmaceutical agents, with an 
emphasis on monoterpenes, sesquiterpenes, diterpenes, and sesterterpenes. 

2. Biosynthesis of Terpenoids 

Terpenoids show enormous chemical and structural diversity. However, their back- 
bones are synthesized from only two five-carbon isomers: isopentenyl diphosphate 
(IPP, C 5 ) and its highly electrophilic isomer, dimethylallyl diphosphate (DMAPP, C 5 ). 
There are two known pathways for the biosynthesis of these two universal precursors 
(Fig. 1). Details on the progress of the elucidation of these two biosynthetic pathways 
are summarized in several excellent reviews (4,8-17). From these two basic building 
blocks, a group of enzymes called prenyltransferases can synthesize linear prenyl 
diphosphates, which serve as the precursors for terpenoid biosynthesis. During biosyn- 
thesis, the active isoprene unit (IPP) is repetitively added to DMAPP or a prenyl diphos- 
phate in sequential head-to-tail condensations catalyzed by the prenyltrans-ferases. The 
reaction starts with elimination of the diphosphate ion from an allylic diphosphate to 
form an allylic cation, which is attacked by the IPP molecule with stereospecific 
removal of a proton to form a new C-C bond and a new double bond in the product 
(Fig. 2) (18). Through consecutive condensations of IPP with an allylic prenyl diphos- 
phate, a prenyltransferase can synthesize a variety of products with fixed lengths and 
stereochemistry. The chain length of prenyl diphosphates ranges from geranyl diphos- 
phate (GPP, C 10 ) to natural rubber, whose carbon chain length extends to several mil- 
lion (18-20). All prenyltransferases require divalent metals such as Mg 2+ or Mn 2+ for 
catalysis. GPP synthase and farnesyl diphosphate (FPP, C 15 ) synthase catalyze conden- 
sation reactions with IPP and DMAPP to form GPP, a precursor to monoterpenes, and 
FPP, a precursor to sesquiterpenes, respectively (21,22). Geranylgeranyl diphosphate 
(GGPP, C 20 ) synthase (18,19) and farnesylgeranyl diphospate (FGPP, C 25 ) (23,24) syn- 
thase use the same condensation reactions to synthesize GGPP, a precursor to diterpene, 
and FGPP, a precursor to sesterterpene, respectively (Fig. 2). 

Terpene cyclases are responsible for the biosynthesis of the thousands of natural 
terpenoid compounds found in terrestrial and marine organisms. Terpenes are synthe- 
sized from lineal - prenyl diphosphates in the cyclization cascades mediated by terpe- 
noid cyclases (also known as terpene synthases or isoprenoid synthases), which are 
known to catalyze the most complex chemical reactions known in chemistry and biol- 
ogy (25-27). A terpenoid cyclase binds and chaperones a linear prenyl diphosphate 
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Fig. 1. Biosynthetic pathways of terpenes. 



substrate through a precise and multistep cyclization cascade that is initiated by the 
generation and propagation of highly reactive carbocationic intermediates, which 
readily undergo dramatic structural rearrangements. The cyclase controls the reactions 
and provides a template for cyclization and rearrangement with stereochemical and 
regiochemical precision (26-30). Over all, two-thirds of the substrate carbon atoms 
undergo changes in chemical bonding and/or hybridization to form a single, unique 
product (27). 

Variations in the number of isoprene unit repetitions, cyclization reactions, and rear- 
rangements are primarily responsible for the chemical and structural diversity of terpe- 
noids (31). Many structurally distinct monocyclic and bicyclic terpenes arise from 
cyclization and rearrangement of GPP; the larger precursors FPP, GGPP, and FGPP 
give rise to an even greater variety of terpene carbon skeletons. The variety of ways in 
which these linear and cyclic skeletons can be arranged, results in the incredible struc- 
tural diversity observed in nature. Many terpenes have carbacyclic ring systems and 
oxidized carbon chains such as alcohols and/or carbonyl groups. Some terpenoids have 
sugar moieties. The bicyclic and polycyclic ring systems with three- and four-mem- 
bered rings appear commonly in terpenes (9,13,32,33). Based on the number of five- 
carbon isoprene units in their linear precursor prenyl diphosphate, terpenoids are 
typically classified as C 5 hemiterpenes, C 10 monoterpenes, C 15 sesquiterpenes, C 2 n 
diterpenes, C 2 5 sesterterpenes, and C 30 triterpenes (3,20). 
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3. Pharmaceutical Terpenoids 

Terpenoids have a very broad range of biological activities. To review all biologi- 
cally active terpenoids would be a difficult task, owing to space constraints. In this 
review we will focus on terpenoids with activities against cancer, malaria, inflamma- 
tion, and a variety of infectious diseases (viral and bacterial). Other chemotherapeutic 
agents for these diseases have been the subject of many excellent reviews (34-71). 
Here, we discuss terpenoids with therapeutic and pharmaceutical functions against 
these diseases. 

3. 1. Monoterpenes 

Monoterpenes are best known as constituents of the essential oils, floral scents, and 
the defensive resins of aromatic plants (72,73). Many monoterpenes are nonnutritive 
dietary components found in the essential oils of citrus fruits, cherry, mint, and herbs. 
The formation of various monoterpenes from geranyl diphosphate is catalyzed by dif- 
ferent terpene cyclases. The general properties of monoterpenes have been discussed 
in several reviews (32,74-76). A number of dietary monoterpenes have antitumor 
activity, exhibiting not only the ability to prevent the formation or progression of 
cancer, but the ability to regress existing malignant tumors (77). 

Limonene (Fig. 3) is the most abundant monocyclic monoterpene found in nature, 
and it occurs in a variety of trees and herbs (e.g., Mentha spp.). It is a major constituent 
of peel oil from oranges, citrus, and lemons, and the essential oil of caraway. It has the 
same skeleton as found in a wide range of important flavor and medicinal compounds, 
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and consequently, limonene has been an interesting target molecule for chemists and 
biologists (78). Limonene is a well-established chemopreventive and therapeutic agent 
against many tumor cells (65,77,79). Carvone, a major monoterpene in caraway seed 
oil, has been shown to prevent chemically induced lung and forestomach carcinoma 
development (80). In addition, carveol has chemopreventive activity against rat mam- 
mary cancer during the initiation phase (81). Perillyl alcohol, a hydroxylated analog of 
limonene, exhibits chemopreventive activity against chemically induced liver cancer 
in rats (82) and tumor recurrences in animals (83). Furthermore, perillyl alcohol exhib- 
its chemotherapeutic activity against rat mammary tumors (83) and transplanted pan- 
creatic cancer in hamster, with a significant portion of treated tumors being completely 
regressed (84). Clinical trial testing of chemotherapeutic activity of limonene and 
perillyl alcohol is in phase I (85) and phase II (86,87), respectively. The mechanism of 
action of monoterpenes against tumor cells is the induction of apoptosis and interfer- 
ence of the protein prenylation of key regulatory proteins (77,79,88,89). 
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The pyrethrins represent a group of six closely related monoterpene esters and are 
valuable insecticidal components isolated from pyrethrum flowers, Chrysanthemum 
cinerariaefolium, and several other species in the Asteraceae family (90,91). Pyre- 
thrins are used for treatment of skin parasites such as head lice. They block sodium- 
channel repolarization of the arthropod neuron, leading to paralysis and death of 
parasites (43,92). However, minor side effects such as dry and scaly patches, edema, 
pruritus, and erythema have been reported when pyrethrins are applied in some forms 
(92). Permethrin, a synthetic analog of pyrethrin, is also used to treat head lice infesta- 
tion. However, the emerging resistance to pyrethrin and permethrin in head lice has 
become a serious concern (93). 

Among other halogenated acyclic monoterpenes, halomin, isolated from the red alga 
Portieria hornemnnii (94), is very effective against renal, brain, colon, and non-small 
cell lung cancer cell lines through a unique mode of action (95). However, further 
elucidation of its atypical biological activity has been hampered by the limited avail- 
ability of its natural source, P. hornemnnii (96). Finally, several acyclic monoterpe- 
nes — citronellol, nerol, and geraniol — exhibit some activity against Mycobacterium 
tuberculosis with MIC (minimal inhibition concentration) values of 64, 128, and 64 
ug/mL, respectively (97). 

3.2. Sesquiterpenes 

In general, sesquiterpenes are less volatile than monoterpenes. Among the sesquiter- 
penes, the sesquiterpene lactones are widely distributed in marine and terrestrial organ- 
isms and are well known for their wide variety of biological activities (98,99). The 
anti-inflammatory activities of some medicinal plants result from the presence of one 
or more sesquiterpene lactones. Feverfew has been used for at least two millennia for 
the treatment of fever, headaches, menstrual difficulties, and stomach aches (100). 
Today it is widely used for the relief of arthritis, migraine, asthma, and psoriasis (101- 
105). Parthenolide (Fig. 4), a sesquiterpene lactone, is responsible for the majority of 
the medicinal properties of this traditional herbal remedy. This sesquiterpene lactone 
can be found in several species (e.g., Tanacetum parthenium, C. parthenium, 
Leucanthemum parthenium, and Pyrethrum parthenium). 

Artemisinin, another sesquiterpene lactone, contains a rare endoperoxide bridge that 
is essential for its antimalarial activity (106). Artemisinin is derived from an ancient 
Chinese herbal remedy and has been isolated from Artemisia annua (sweet wormwood 
or "Qinghao"), a species of the Asteraceae family. This herbal plant has been used in 
Chinese herbal medicine for over 200 years. Artemisinin and its derivatives represent a 
very important new class of antimalarial drugs, and are used throughout the world. It is 
effective against both drug-resistant and cerebral malaria-causing strains of Plasmo- 
dium falciparum. As an antimalarial agent, artemisinin has been extensively reviewed 
by several researchers (35,39,56,107,108). Chamazulene, oc-Bisabolol, and bisabiolol 
oxides A and B, are terpenoids isolated from matricaria flowers (Matricaria chamolilla) 
and are commonly used in herbal medicine for the treatment of skin inflammation, and 
as antibacterial and antifungal agents (13). Chamazulene's anti-inflammatory activity 
is a result of blocking of leukotriene biosynthesis (109). 

Many plant sesquiterpenes have also been shown to be effective against the causal 
agent of tuberculosis (TB), M. tuberculosis, which infects approx 8 million people and 
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causes 2 million deaths each year. Tuberculosis is still the leading killer among all 
infectious disease, especially in developing countries. With the emergence of 
multidrug-resistant strains of M. tuberculosis, the search for new antituberculosis 
agents has become increasingly important (48). More than 50 sesquiterpenes from 
plants exhibit significant antituberculosis activity. Sesquiterpene lactones of the 
germacranolide (Fig. 5), guaianolide, and eudesmanolide types are shown to have 
antituberculosis activity, with MICs ranging from 2 ug/mL to >128 ug/mL. Details 
of the antituberculosis activity of these sesquiteperne lactones are discussed thor- 
oughly by Cantrell et al. (45). 

Many sesquiterpenes isolated from the marine environment also show activity 
against tuberculosis (50). Axisonitrile-3 (Fig. 6), a cyanosesquiterpene isolated from 
the sponge Acanthella klethra, is a potent inhibitor of M. tuberculosis, with an MIC of 
1.56 ug/mL (110). Puupehenone, 15-cyanopuupehenone, and 15-oc-cyanopuupehenol, 
isolated from sponges of the orders Verongida and Dictyoceratida (111-113), are 
natural sesquiterpene-shikimate-derived metabolites. Some of them have attracted 
significant attention from several research groups because of their cytotoxic, antimi- 
crobial, and immunomodulatory activities (113). Puupehenone, 15-cyanopuupehen- 
one, and 15-oc-cyanopuupehenol demonstrated 99, 90, and 96% inhibition of M. 
tuberculosis (H3Rv) growth, respectively, at MICs of 12.5 ug/mL. It has been shown 
that the quinine-methide system in ring D of puupehenone is essential for its inhibi- 
tory activity (50). 

Sesquiterpene quinines and hydroxyl quoinones and their related compounds repre- 
sent a prominent class of biologically active metabolites (114). Their occurrence seems 
to be restricted to sponges of the three families Spongiidae, Thorectidae, and 
Dysideidae of the order Dictyoceratida, to the family Niphatidae of the order 
Haplosclerida, and to the algal species Dictyopteris undulata (115). Their remarkable 
biological activities include antibacterial, cytotoxic, anti-inflammatory, anti-human 
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immunodeficiency virus (HIV), and protein kinase inhibition (116-120). Among these 
marine sesquiterpenoids, avarol and avarone, isolated from the Red Sea sponge Dysidea 
cinerea have been shown to inhibit HIV reverse transcriptase (RT) with respect to its 
natural substrate (dNTP) (121). However, their anti-HIV activity was determined in 
vitro using cell cultures, so it is not yet known whether the anti-HIV activity was a 
result of inhibition of HIV-1 RT (48). Avarol and avarone also exhibit strong antican- 
cer and antibacterial activities (122). Another quinine sesquiterpene, ilimaquinone, iso- 
lated from the Red Sea sponge Smenospongia sp., also has been reported to inhibit the 
RNase activity of the RT from human HIV type I at concentrations of 5-10 ug/mL, 
while being less potent against RNA-dependent DNA polymerase and DNA-depen- 
dent DNA polymerase (123). Ilimaquinone also exhibits antimitotic and anti-inflam- 
matory activities, promotes a reversible vesiculation of the Golgi apparatus, and 
interferes with intracellular protein trafficking (124). Bolinaquinone, a sesquiterpene 
hydroxyl quinine, has recently been isolated from the Philippine Dysidea sponge (117). 
It exhibits activity against the human colon tumor cell line HCT1 16 with an IC 50 value 
of 1 .9 ug/mL, mild inhibition of Bacillus subtilis at 80 ug/disk, remarkable inhibition 
of phospholipase A2, and anti-inflammatory activity (117-120). 

Illudins are a family of natural toxic sesquiterpene compounds with anti-tumor 
activity, isolated from the basidiomycete Omphalotus illudens (O. olearius and 
Clitocybe illudens). These compounds are believed to be responsible for the poisoning 
that occurs when Omphalutus is mistaken for an edible mushroom (125). Illudins S and 
M are extremely cytotoxic and exhibit antitumor activity (126,127). Illudins are effec- 
tive against various types of tumor cells at picomolar to nanomolar concentrations. A 
variety of multidrug-resistant tumor cell lines remain sensitive to the illudins (128). 
Irofulven (hydroxymethylacylfulvene), a derivative of illudin S, has been extensively 
investigated and is currently in phase II clinical trials. In particular, irofulven exhibits 
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efficacy against pancreatic carcinoma, a malignancy that is resistant to all other forms 
of chemotherapy. Irofulven rapidly enters cancer cells, where it binds to cellular mac- 
romolecules and inhibits DNA synthesis (129,130). The most unique aspect of 
irofulven' s anticancer activity seems to be its ability to act as a selective inducer of 
apoptosis in human cancer cell lines, and, in contrast to conventional antitumor agents, 
this activity of irofulven is effective against tumor cell lines regardless of their p53 or 
p21 expression (131,132). In addition, the DNA lesion induced by illudins and irofulven 
is largely ignored by global repair pathways. Therefore, the irofulven and other illudins 
are considered a new and promising class of tumor-therapeutic agents (133). 

3.3. Diterpenes 

The diterpenes represent a large group of terpenoids with a wide range of biological 
activities, isolated from a variety of organisms (134-146). One of the simplest and 
most important acyclic diterpenes is phytol (Fig. 7), a reduced form of geranylgeraniol. 
This terpenoid is perhaps the most studied biomarker of those found in aquatic envi- 
ronments, and it is a side chain of chlorophyll-a (12,147). (E)-phytol, isolated from 
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Lucas volkensii, exhibits significant antituberculosis activity, with a MIC of 2 ug/mL 
(97). Among the cyclic diterpenes, taxines isolated from the common yew (Taxus 
baccata; Taxaceae) represent an important group of compounds whose structure is 
based on the taxadiene skeleton. Taxines have been well studied because of their anti- 
cancer activity (54). 

Over 100 different taxanes have been characterized from various Taxus species. 
Paclitaxel (Taxol®) is a member of this group and possesses a four-membered oxetane 
ring and a complex ester side chain, both of which are essential for anticancer activity. 
The biosynthesis of Taxol involves complicated cyclizations and modifications (Fig. 8). 
Like epothilones and several other anticancer agents, paclitaxel is able to interact with 
tubulin (or microtubues) and inhibit cell proliferation by acting on the mitotic spindle 
through inhibition of microtubule polymerization or microtubule stabilization 
(37,38,42,63,148-150). Paclitaxel is currently used to treat ovarian, lung, and breast 
cancers, head and neck carcinoma, and melanoma. It has been hailed as the "perhaps 
most important addition to the chemotherapeutic armamentarium against cancer over 
the past several decades"(75i ). 

The development of paclitaxel into an anticancer drug spans several decades. In 
1966, a crude extract of bark from the Pacific yew was demonstrated to have a broad 
range of antitumor activities against several tumor cell lines. In 1971, paclitaxel was 
identified as the active constituent of the crude extract and its structure determined 
(152). Paclitaxel was approved by the Food and Drug Administration (FDA) for the 
treatment of advanced ovarian cancer in 1992 and of breast cancer in 1994. Paclitaxel 
inhibits microtubule depolymerization, promotes the formation of unusually stable 
microtubles, and thereby disrupts the dynamic reorganization of the microtubule net- 
work required for mitosis and cell proliferation, and causes cellular arrest in the G 2 /M 
phase of the cell cycle (153,154). The protracted arrest of the cell cycle during the 
mitotic phase is considered to be an important mechanism of paclitaxel-induced cyto- 
toxicity. Nevertheless, the precise mechanisms of cytotoxicity of paclitaxel against 
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cancer cells is not entirely clear (150). In addition, the low aqueous solubility and the 
development of clinical drug resistance are two problems associated with paclitaxel. 
Furthermore, as for many other anticancer drugs, paclitaxel has several side effects, 
including neutropenia, peripheral neuropathy, alopecia, and hypersensitivity reactions 
(155). 

Eleutherobin is another microtubule-stabilizing diterpenoid originally isolated from 
a marine soft coral species of the genus Eleutherobia (5). It was shown to possess 
activity as an inhibitor of microtubule depolymerization with a mean cytotoxicity 
greater than those of taxol or the epothilones (156). Eleutherobin belongs to the 
eleutheside family of marine diterpenoid and is extracted in extremely low yields (0.01- 
0.02% of the dry weight of the rare alcyonacean Eleutherobia sp.). Several elegant 
total syntheses have been reported (157-160). However, neither chemical syntheses 
nor the original source have provided sufficient quantities of eleutherobin to permit 
full in vivo evaluation, and this has thwarted its further development (161). Recently, 
Erythropodium caribaeorum, a relatively abundant Caribbean gorgonian, has been 
found to be a good source of eleuthesides (162,163) and can provide sufficient 
eleutherobin for preliminary animal studies and chemical transformation to new ana- 
logs (164). Sarcodictyin A and B, analogs of eleutherobin, are marine diterpenoids 
isolated from the Mediterranean stoloniferan S. roseum (6, 7) and then from the South 
African soft coral E. aurea along with two glycosylated congeners, eleuthosides A and 
B (165). Sarcodictyin A and B demonstrated very low resistance factors against the 
P-glycoprotein-overexpressing human cancer cell line (166), and their intrinsic 
antiproliferative activity against drug-sensitive cells seems significantly lower than 
those of eleutherobin and Taxol assayed in vitro (38). 

Studies involving combinatorial libraries and natural analogs of eleutherobin indi- 
cate that the loss of the sugar moiety dramatically changes the potency of eleutherobin 
against cancer, whereas modification of C-8 side chain has little effect on its potency 
(167,168). However, the C-8 side chain is essential for the cytotoxicity of the 
sarcodyctyins; and both imidazole nitrogens are also required (160). 
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Extracts of the Gorgonian Pseudopterogorgia elisabethae show anti-inflammatory 
activity and are currently used as an ingredient in cosmetic skin-care products (70). 
The active constituents in the extracts have been identified as diterpene glycosidesm 
(pseudopterosins), which have analgestic properties and are used for promoting wound 
healing and as inhibitors of PLA 2 (169,170). Methopterosin (OAS100), a semisyn- 
thetic product of pseudopterosin, also has anti-inflammatory activity and is in clinical 
development for the promotion of wound healing (55,70). Erogorgiaene (Fig. 9), a 
serrulatane diterpene (also known as biflorane), isolated from the West Indian gorgon- 
ian P. elisabethae, induces 96% growth inhibition for M. tuberculosis H37V at a concen- 
tration of 12.5 ug/mL (171). The benzoxazole diterpene alkaloid pseudopteroxazole, 
isolated from the same gorgonian, shows 97% growth inhibition for M. tuberculosis 
H37Rv at a concentration of 12.5 ug/mL without substantial toxic effects (172). 

The bromoditerpene sphaerococcenol A, isolated from the red alga Sphaerococcus 
coronopifolius collected along the Atlantic coast of Morocco, has antimalarial activity 
against the chloroquine-resistant Plasmodium falciparum FCB 1 strain, with an IC 50 
of 1 fiM (173). Two other bromoditerpenes, bromosphaerone and 12S-hydroxy- 
bromosphaerodiol, isolated from the same species, show strong antibacterial activity 
against Staphylococcus aureus, with MICs of 0.104 and 0.146 \\.M, respectively 
(173,174). 

Diisocyanoadociane, a tetracyclic diterpene with an isocycloamphilectane skeleton, 
isolated from the sponge Cymbastela hooperi, has been reported to have significant 
antimalarial activity in vitro against two clones of the malaria parasite P. falciparum 
(175). The tetracyclic diterpene demonstrated significant antiplasmodial activity, with 
IC 50 values of 4.7 ng/mL and 4.3 ng/mL and selectivity indices (SI) 1000 and 1100 
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against the two clones of P. falciparum. Another tricyclic diterpene, 20-isocyanto-7- 
isocyanoisocycloamphilectane, isolated from the same sponge, also shows the same 
level of antiplasmodial activity and selectivity. In addition, the potency and selectivity 
of the two tetracyclic diterpenes from the sponge C. hooperi rivals the in vitro results 
obtained with the currently prescribed antimalarial drugs (Artemisinin and chloroquine) 
(173). 

Finally, solenolide A, a diterpene lactone, isolated from a new Indopacific gorgon- 
ian species of the genus Solenop odium, has been reported to inhibit rhinovirus with an 
IC 50 value of 0.39 ug/mL. This diterpene also shows inhibitory activity against poliovi- 
rus III, herpesvirus, and the Ann Arbor and Maryland viruses (176). Spongiadiol, a 
tetracyclic furanoditerpene isolated from deep-water Spongia sp., shows inhibitory activ- 
ity against HSV1 at a concentration of 0.5 ug/disk (177,178). 

3.4. Sesterterpenes 

Sesterterpenes are the least common group of terpenoids. They are primarily iso- 
lated from fungi and marine organisms, and encompass relatively few structural types 
(12,13). Although many examples of natural sesterterpenes are known, studies of their 
biosynthesis are rare (12). 

Many sesterterpenes inhibit the activity of the human secreted type IIA phospholi- 
pase A 2 (PLA 2 ). LPA 2 is involved in the pathogenesis of a variety of inflammatory 
diseases via the production of arachidonic acid, the precursor of prostaglandins and 
leukotrienes. Therefore, secreted PLA 2 has been considered to be a primary target for 
the development of anti-inflammatory drugs (119). The well-known PLA 2 inhibitors 
among sponge sesterterpenes are the scalaranes, which were named after scalaradial 
(Fig. 10), isolated from Cacospongia mollior (179,180). Marine sesterterpenes con- 
taining the y-hydroxybutenolide moiety have been studied for their potent anti-inflam- 
matory activity. Manoalide, which is the first marine PLA 2 inhibitor of sesterterpene 
and a reference compound for this class of natural products, is a potent analgesic and 
anti-inflammatory agent isolated from the pacific sponge Luffariella variabilis (181). 
The anti-inflammatory activity of manoalide is ascribed to its irreversible inhibition of 
PLA 2 (181-183), resulting in inhibition of the formation of pro-inflammatory media- 
tors such as leukotrienes and prostaglandins (184,185). This compound was formerly 
in phase I clinical trials, but its development was later discontinued (55). Thereafter, 
many other related molecules have been isolated, such as seco-manoalide, luffariellolide, 
luffariellins, luffolide, the cacospongionolides, and the petrosaspongiolides M-R 
(181,186-192), all of which are irreversible inhibitors of PLA 2 . Among these com- 
pounds, petrosaspongiolide M has been investigated for its pharmacological activity in 
vivo and in vitro. This compound has significant inhibitory activity against PLA 2 (IC 50 
of 0.6 \yM) (186) and reduces the level of prostaglandin E2, tumor necrosis factor a, 
and leucotriene B4 in a dose-dependent manner (193-195). Moreover, it has no signifi- 
cant side effects. Consequently, there is considerable interest in the development of 
this molecule and its analogs for the treatment of acute and/or chronic inflammation 
( 194,196). 

The novel sesterterpene salmahyrtisol A (Fig. 11) and three other new scalaranes- 
type sesterterpenes (3 -acetyl sesterstatin 1, 19-acetyl sesterstatin 3, and salmahyrtisol) 
isolated from the Red Sea sponge Hyrtios erecta, show significant cytotoxicity 
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against murine leukemia, human lung carcinoma, and human colon carcinoma (197). 
Halorosellinic acid, an ophiobolane sesterterpene isolated from the marine fungus 
Halorosellinia oceanica, exhibits antimalarial activity, with an IC 50 of 13 ug/mL, and 
weak antimycobacterial activity, with a MIC of 200 ug/mL (198). Mangicols A-G are 
a group of sesterterpenes possessing novel spirotricyclic natural compounds, isolated 
from the marine fungus tentatively identified as Fusarium heterosporum (199). Among 
these compounds, mangicols A and B show significant anti-inflammatory activity in 
phorbol myristate acetate-induced mouse ear edema assay. 

4. Chemical Synthesis in Terpenoid Drug Production and Development 

During the first part of the 20th century, total synthesis played a central role in iden- 
tification and structure confirmation of the active principle in a crude extract from 
natural sources (200,201). However, with the development of modern analytical and 
purification techniques and spectroscopic methods, structures of most natural products 
can now be determined unambiguously. Nevertheless, total synthesis is still the ulti- 
mate proof for the structure of complex natural products, especially the absolute stere- 
ochemistry or remote relative stereochemistry. Many terpenoids with promising 
biological activity are structurally too complex to be readily synthesized in a cost- 
effective way via total synthesis from commercially available materials. Semisynthesis, 
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on the other hand, has played a major role in the development and production of terpe- 
noid-derived drugs. 

The isolation of paclitaxel from nature did not produce enough material for clinical 
trials (202). In addition, killing yew trees for compound isolation was a big environmen- 
tal concern. The four-step semisynthesis (Fig. 12) of paclitaxel fromlO-deacetylbaccatin 
III greatly facilitated the development of Taxol. 10-deacetylbaccatin III was found in 
the needles of the common European yew tree T. beccata as well as a yew tree species 
found in India. By harvesting and extracting the needles, baccatin III or 10-deactyl 
baccatin III can be provided in large quantities as precursors of paclitaxel without sub- 
stantially injuring the tree populations. In addition to paclitaxel, analogs such as 
docetaxel (Taxotere®) were synthesized in sufficient yield by semisynthesis. Com- 
pared with paclitaxel, semisynthetic docetaxel has improved water solubility and is 
being used clinically against breast and ovarian cancer (151). Six groups completed 
total synthesis of paclitaxel between 1994 and 2002 (203-216). However, none of their 
methods are comparable to semi-synthesis in terms of the cost for large-scale produc- 
tion. 

Artemisinin (Qinghaosu) has also been produced by semisynthesis. Some pheno- 
types of A. annula have been found to produce as much as 1% Artemisinin, but the 
yield is normally very much less, typically 0.05-0.2%. The more abundant sesquiter- 
pene in the plant is artemisinic acid (qinghao acid, typically 0.2-0.8%). Fortunately, 
artemisinic acid can be converted chemically into Artemisinin by a relatively simple 
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and efficient process (Fig. 13) (217-219). At the same time, Artemisinin can be re- 
duced to the lactol (hemiacetal), and this can be used for the semi-synthesis of a range 
of analogs, of which artemether, arteether, and the water-soluble sodium salts of 
artelinic acid and artesunic acid are promising second-generation antimalarial drugs 
(220). The total synthesis of Artemisinin was completed in 1983, and the peroxy bridge 
was confirmed to be its unique feature conferring antimalarial activity (221). 

Although total synthesis plays a less important role in production of terpenoid 
drugs, it has been shown to be an important tool in the development of compounds 
derived from terpenoids and other natural compounds. First, total synthesis is an 
essential tool for studying structure-activity relationships (SAR). For example, the 
binding of manoalide to PLA 2 was demonstrated by using the two-masked aldehyde 
present in the molecule. Extensive SAR studies using chemical synthesis of various 
analogs have revealed that the minimum structural requirement for activity is the 
presence in the inhibitor of functional groups able to bind to the amino groups of 
PLA 2 lysine residues. Many manoalide analogs have thus been synthesized, and many 
of them share PLA 2 inhibitory properties (222). A similar approach has also revealed 
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the active moieties of paclitaxel, eleutherobin, sarcodictyin A, and Artemisinin 
(106,160,163, 167,168,223). 

Second, it is often difficult to correlate the relative stereochemistry of remote chiral 
centers by spectroscopic methods. Total synthesis is the ultimate tool for solving this 
problem. For example, the chemical structure of eleutherobin was elucidated by exten- 
sive two-dimensional nuclear magnetic resonance spectroscopy and mass spectrom- 
etry (5). L-arabinose, the sugar moiety of eleutherobin, appeared to be favored in nature. 
This was proved to be wrong by total synthesis. The relative stereochemistry of the 
diterpenoid 4,7-oxaeunicellane skeleton and the arabinose of eleutherobin were assigned 
by the Danishefsky and Nicolaou groups through total synthesis (224-227). It is remark- 
able that they could unambiguously identify the sugar unit of eleutherobin as D-arabi- 
nose despite the fact that L-arabinose is the natural abundant enantiomer. 

Finally, libraries of structurally diverse natural-product-like molecules have con- 
tributed to the understanding of biological processes in small molecule-based system- 
atic approaches (228). Efficient access to diversity can be achieved in diversity -oriented 
synthesis using pairs of complexity-generating reactions, where the product of one is 
the substrate for another (229). Diversity can also be achieved by chemical modifica- 
tion of natural products. For example, the potential of macrocyclic diterpenoids to 
afford natural-product-like polycyclic compounds was demonstrated by the conver- 
sion of two lathyrane Euphorbia factors into a series of densely functionalized 
diterpenoids of unnatural skeletal type (Fig. 14) (230). 

5. The Sustainable Production of Terpenoids 

Like many other natural products, terpenoids are usually extracted from the source 
organism in extremely low yields. Producing an adequate and sustainable supply of 
these compounds has been one of the major challenges for developing terrestrial and 
marine organism-derived natural terpenoid compounds into clinically useful entities 
(164). Harvesting source organisms from the environment for the target terpenoid is 
not a feasible strategy because of environmental and ecological concerns, especially 
for terpenoids derived from marine environments. For example, eleutherobin, a struc- 
turally complex diterpene glycoside, constitutes 0.01-0.02% of the dry weight of the 
rare alcyonacean coral Eleutherobin sp. (5). The limited supply of paclitaxel from its 
primary source once hampered the development of paclitaxol into a clinically used 
drug (150). The limited supply of eleutherobin has restricted the evaluation of this 
promising marine diterpene (162). Consequently, a serious obstacle to the commercial 
development of most promising terpenoids is their availability from the natural envi- 
ronment. To provide enough material for preclinical evaluation and clinical trials, sev- 
eral approaches have been used to produce terpenoids with promising biological 
activities, and their implementation should greatly facilitate the development of novel 
terpenoid-related drugs. 

5. 1. Metabolic Engineering 

The redirection of metabolic pathways for enhanced production of existing natural 
products or for production of "unnatural" natural products has been an active area of 
research. Metabolic engineering, which integrates engineering design and systematic 
and quantitative analysis of pathways using molecular biology, modern analytical tech- 



214 



OCHO 




Ac6 OH \ 

OCHO 




PhAcO H OAc^OCHO 




Wang, Tang, and Bidigare 



PhAc ° Ac6 V 
euphorbia factor L1 

/ + 




PhAcO 



OAc 



PhAcO 



OAc 




OMe 



Fig. 14. Diversity via natural-product derivatization. 



niques, genomic approaches, and "-omics" technologies into production of natural 
products, will provide alternative approaches to produce terpenoids with pharmaceuti- 
cal value. Indeed, enhancing monoterpene yields and changing the metabolite compo- 
sition of essential oils in plants has been shown to be feasible. Using the transgenic 
plants to increase the production of monoterpenes for scent and flavor has been demon- 
strated in flowers (231,232) and fruits (233). Moreover, the pathway engineering 
approach has been used to successfully produce several interesting sesquiterpenes, 
including amorphadiene, the precursor of the antimalarial drug artemisinin, in E. coli 
(234,235). As a powerful tool for the production of both natural products and the library 
of their analogs, the engineered fermentable microbial host will provide renewable and 
sustainable resources to generate terpenoids with interesting biological activities. 

5.2. Cell Culture and Others 

Advances in cell culture have also provided a promising means for the production of 
valuable secondary metabolites from plants. Production of paclitaxel and other taxanes 
using cell-culture techniques has been well established. The level of taxoid production 
in bioreactors has been reported to reach 612 mg/L in T. chinensis cell cultures (236). 
Strategies for enhancing paclitaxel and its precursors have been reviewed extensively 
by Zhong (237). At the same time, the production of artemisinin has also been pursued 
in callus, cell suspension, and shoot and hairy root cultures of A. annua (238-242). 
Undifferentiated callus and cell suspension cultures produce extremely low yields of 
artemisinin. However, differentiated shoot cultures and hairy roots show promising 
potential for artemisinin production (237). Compared with the paclitaxel production 
level, the production yield of artemisinin in cell culture is still not significant enough to 
be used for commercial purposes. 

The bioprocess engineering of marine macroalgae for the production of halogenated 
monoterpenes has been carried out in small-scale tissue-culture bioreactors (243- 
245). In the established system, the photosynthetic tissue culture was developed for 
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the marine red algae Ochtodes secondiramea and P. hornemannii using callus-induc- 
tion and shoot-regeneration techniques. Rates of monoterpene production have been 
shown to be affected by temperature, light intensity, and nitrate concentration in the 
growth medium (246,247). Metabolic flux analysis indicated that the halogenated 
monoterpene production was not limited by its precursor (myrcene) and that chlorina- 
tion is the target for increasing the production (248). 

Recently, aquaculture systems designed and engineered for the production of large 
quantities of biomass of two species of marine invertebrates (Bugula neritina and 
Ecteinascidia turbinate) have been established in order to harvest their natural chemi- 
cal constituents (249). Aquaculture will also provide an alternative approach for pro- 
ducing valuable terpenoids from marine invertebrates in a reliable, renewable, and 
cost-effective way. However, additional research is still needed in order to optimize 
controlled environment culture systems and to explore the feasibility of such systems 
for other marine invertebrates. Finally, the Caribbean gorgonian E. caribaeorum has 
been cultured in shallow running seawater tanks located in a greenhouse under ambient 
sunlight illumination (164). The cultured E. caribaeorum produce eleutherobin and the 
briarane diterpenoids erythrolides A and B in yields comparable to those reported from 
wild-harvest reef animals. Therefore, the aquaculture could enhance large-scale marine 
terpenoid production in the near future. 

6. Concluding Remarks 

Terpenoid-derived drugs have contributed significantly to human disease therapy 
and prevention. Some terpenoid drugs have provided tremendous benefits for patients 
and for the pharmaceutical industry. Artemisinin and its derivatives comprise a multi- 
million-dollar market worldwide. Taxol alone is estimated to have annual sales of over 
$1.8 billion. Terpenoids indisputably continue to be important compounds for drug 
discovery. In last two decades in particular, many terpenoids with promising biological 
activities have been isolated from diverse marine environments. These marine terpe- 
noids exhibit an impressive array of novel structural motifs, many of which are consid- 
ered to be derived from biosynthetic pathways that are exclusive to marine organisms. 
Moreover, most of these marine terpenoids possess remarkable biological activities 
whose potential benefits extend beyond the marine ecosystem and embody the devel- 
opment of new antifungal, anticancer, anti-inflammatory, and antiviral drugs (250). 
Although marine natural-product bioprospecting has begun only relatively recently, it 
has already yielded over a thousand novel molecules. Marine natural-compound 
bioprospecting will continue to provide more promising terpenoids and other natural 
products for drug development. In addition, marine biodiversity is estimated to be much 
greater than that on land (55). Many marine animals, plants, fungi, and algae are rich 
sources of novel terpenoids. With the continuing exploration of the oceans and afford- 
able technology for the exploration of the deep oceans, more novel terpenoids with 
promising biological activities are expected to be discovered from marine environ- 
ments in the near future. 

In addition, chemical synthesis (e.g., derivatization of natural products) and combi- 
natorial synthesis of natural product analogs can also play a significant role in provid- 
ing SAR data for a particular target. Accurate knowledge of the structural features 
required for activity in each compound class will give the predictive power of 
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pharmacophore models. For example, several pharmacophore models have attempted 
to reconcile the SAR data for taxoids and other compound types in order to generate a 
sufficiently detailed understanding of tubulin-binding requirements. Information from 
this approach will allow rational design of new classes of microtubule-stabilizing drugs. 
The terpenoids encompass a wealth of significantly diverse compounds, providing 
chemists with great opportunities to synthesize not only new, valuable terpenoid drugs, 
but also novel terpenoid compounds that can be used as tools for understanding bio- 
chemical pathways. 

Finally, along with the information derived from the human genome and metabolic 
pathways, high-throughput technologies such as proteomics, transcriptomics, and 
metabolomics will greatly shorten the time required for assaying natural compounds in 
vivo and in vitro. In parallel, new analytical instruments and synthetic approaches will 
further facilitate both the identification of terpenoids from natural sources and better 
understanding the SAR. In particular, marine biodiversity is far beyond what we have 
dreamed. Isolation and identification of marine terpenoids should draw more attention 
from drug discovery and development. Engineered new assays and high-throughput 
tools should facilitate identification and isolation of new terpenoids. There is no doubt 
that more terpenoid-based clinical drugs will become available and play a more signifi- 
cant role in human disease therapy. 
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Challenges and Opportunities in the Chinese Herbal Drug 
Industry 



Wei Jia and Lixin Zhang 



Summary 

Tremendous achievements have been made in Western medicine in the past that provide fast 
relief of symptoms at the disease sites, particularly under critical conditions. However, some are either 
ineffective or produce undesirable adverse effects, or are too costly in some complex diseases, espe- 
cially chronic diseases. On the other hand, traditional medicines strive to focus on the balance of the 
body in a holistic manner. Traditional medicines are increasing popular in the Western world, as 
reflected in the name changes from "alternative medicines" to "complementary medicines" or even 
"integrative medicines." Traditional Chinese medicines (TCM) have been used in China for more 
than 2000 yr and have always followed the philosophy and principle of restoration, i.e., the yin and 
yang (balance of the body). As an important part of the pharmaceutical sector, the Chinese herbal drug 
industry has made rapid progress over the past decades. This chapter provides a review of its current 
status, including challenges and opportunities, specifically with regard to modernization and global- 
ization of TCM. 

Key Words: Complementary and alternative medicine (CAM); integration; traditional Chi- 
nese medicine (TCM); modernization; globalization; preventive therapy. 

1. Introduction 

Instead of the hit-or-miss technology of the past, current biological research and 
much of drug discovery is being driven by the search for new molecules targeting 
disease-relevant proteins (1-3). In this approach, a specific protein is studied in vitro, 
in cells and in whole organisms, and evaluated as a drug target for a specific therapeu- 
tic indication (3-5). The historical paradigm "one drug, one target" has resulted in the 
identification of many effective chemical molecules that affect specific proteins, pro- 
viding valuable reagents for both biology and medicine. One example is the recently 
US Food and Drug Administration (FDA)-approved drug Avastin, a recombinant 
humanized antibody designed to bind to and inhibit vascular endothelial growth fac- 
tor (VEGF) in tumor angiogenesis (6). Benefiting from the advancement of biology 
and chemistry, the world market of drug products has increased tremendously over the 
past decade, from $180 billion USD in 1990 to $430 billion in 2001 (7). IMS Health, a 
well-known information company of the pharmaceutical industry, anticipated that the 
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Fig. 1. Annual sales of the world drug industry. 



annual sales of drug products would reach $587.9 billion USD by the year 2005, with 
an average growth rate of 7.8%, much higher than the average growth rate of the global 
economy (2.4%) (Fig. 1) (7). 

On the other hand, the expensive new technologies, innovation, and intellectual prop- 
erties, combining factors such as regulatory issues and high failure rate, really cause 
health care costs to skyrocket. It costs about $1 .7 billion to bring a new chemical entity 
(NCE) drug to market, a 55% increase over the average cost during 1995 to 2000 (8- 
12). How can we increase the efficiency of drug development while reducing the fail- 
ure rate of compounds in clinical trials? Pharmaceutical companies are increasingly 
responding with mergers, partnerships, and intensified promotion. They also shift 
focus to diseases of richer and older people, and away from antimicrobials, vaccines, 
and the like (13). Ironically, even though the investment on research and development 
is steadily increasing, the pharmaceutical discovery pipeline has declined substantially 
over the past decade. One of the major limiting factors is the prevalent "one drug, one 
target" dogma in the biotechnology and pharmaceutical industries. Pharmaceuticals 
have been traditionally designed to target individual factors in a disease system but 
have limited success in some complex diseases, especially chronic diseases. The rea- 
son is that, in real physiology, diseases are multigenic. Therefore, to cure diseases, 
multiple stages along the disease pathway may need to be manipulated simultaneously 
for an effective treatment (1). Systems and integrative biology have revealed that 
human cells and tissues are composed of complex, networking systems with redun- 
dant, convergent and divergent signaling pathways (14-21). On the other hand, drugs 
are targeting human beings, and each individual is different; this leads to the popularity 
of pharmacogenomics and personalized medicine (22). 

Traditional medicines focus on the balance of the body in a holistic manner and 
consider the difference between individuals. Traditional medicine is increasingly popu- 
lar in the Western world, as reflected in the name changes from "alternative medi- 
cines" to "complementary medicines" or even "integrative medicines" (23-29,119). 
Name changes reflect a dissatisfaction with conventional Western medicine as well as 
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a cultural rebellion against the biomedical community, and a process to bring tradi- 
tional medicine into alliance with and integrate it into a pivotal part of mainstream 
health care. As many as 42% of individuals in the United States are adopting comple- 
mentary and alternative medicine (CAM) approaches to help meet their personal health 
needs (30,31). The popularity of CAM has spread to the whole world (32,33). There 
probably will be a paradigm shift to bridge conventional Western medicines with TCM 
and enable the drug discovery pipeline to become more productive with respect to safer 
and better drugs. 

In 1998, responding to public demand for better guidance regarding the myriad of 
CAM options, the United States Congress authorized the creation of the National Cen- 
ter for Complementary and Alternative Medicine (NCCAM) at the National Institutes 
of Health (NIH). The establishment of the center represented an expansion in scope 
and authority of the Office of Alternative Medicine (OAM), which was first estab- 
lished in 1992 (34). Patients who choose CAM approaches are seeking ways to improve 
their health and well-being, and to relieve symptoms associated with chronic or even 
terminal illnesses, or with the side effects of conventional treatments. The effective- 
ness, scientific rationale, side effects, and cost-effectiveness of different kinds of CAM 
(including dietary modification; supplementation with antioxidant vitamins, soy, herbs; 
acupuncture; massage; exercise; psychological and mind-body interventions) have 
been studied and compared in parallel on cancer patients (35). TCM emphasizes the 
proper balance or disturbances of Qi — or vital energy — in health and disease, respec- 
tively. TCM consists of techniques and methods such as acupuncture; microbial, plant, 
and animal products; physical exercises and calisthenics with, or without, meditation; 
Qi Gong; massage; and other forms. In this chapter, we will focus on the status of 
herbal remedies (typical natural products) either as single-chemical entity drugs or 
complexed botanical supplements. 

More than 60% of small-molecule drugs are either natural products or their deriva- 
tives (36,37). The total number of natural products produced by plants has been esti- 
mated to be over 500,000 (38). About 160,000 natural products have been identified 
(39), a number growing by 10,000 per year (40^43). Many important drugs are derived 
directly or indirectly from the active ingredients of herbal remedies, such as the prede- 
cessor of aspirin from willow tree bark (44), reserpine from the herb Rauwolfia 
serpentina, Taxol from the Pacific yew tree, Qinghaosu (artemisinin and senna) from 
Artemisia annua and Cassia angustifolia, quinine from cinchona bark, digitalis from 
foxglove leaf, morphine from the poppy herb, and vincristine from rosy periwinkle 
(45-49). 

In the past decade, the American government issued or proposed several new regu- 
lations in favor of developing Chinese medical herbal products in the United States. 
Following the Dietary Supplement Health and Education Act (DSHBA), passed by the 
US Congress in 1994 (86), the FDA issued the Final Rule of Dietary Supplements on 
February 7, 2000. In accordance with current FDA regulations, many TCM products 
may be promoted and marketed in the United States as dietary supplements. The FDA 
also drafted "Guidance for Industry: Botanical Drug Products," which encourages 
medical societies and pharmaceutical companies to develop botanical materials for 
curing human diseases (87). These developments provide a great opportunity to the 
TCM pharmaceutical industry. 



232 Jia and Zhang 

In Europe, the same substance may be considered a dietary supplement in one coun- 
try while being sold as an over-the-counter (OTC) drug in another country (50,51). 
This situation will change if the European Union develops a uniform set of laws to be 
followed by its member states. There exists a long tradition in Asian countries dating 
back thousands of years of identifying and using herbal ingredients to successfully 
treat various diseases. Usage rates range from 29% to 53% among various patient popu- 
lations in Korea (26). It is very popular in China, and China's herbal drug industry has 
contributed significantly to this rich reservoir of both herbal drugs and dietary supple- 
ments (53-61). 



2. The Development of the Chinese Herbal Drug Industry 

TCM has been used in Chinese medical practice for more than 2000 years. Based on 
available literature, TCM products were safe and effective for the treatment of many 
human diseases before Western medicines were introduced and marketed in China. 
TCM has been an integral part of China's health care system, along with conventional 
Western medicine (59-61). Patients generally enjoy the benefit of combining the power 
of traditional and conventional Western medicine. 

Although herbal medicine has a long history and pharmacological foundation in 
China, a TCM industry was only established a century ago by adopting some modern 
technologies. A system for the production and circulation of herbal drug products has 
been formed in China with the production of Chinese medicinal materials as its foun- 
dation, Chinese herbal manufacturing industry as its main body, and a Chinese herbal 
drug-marketing network as its linkage. 

China is rich in medicinal resources, with more than 11,100 species of medicinal 
herbs and 2000 prescription recipes well documented. There are more than 600 planta- 
tion bases for the production of TCM raw materials. About 5 million mus (1 mu = 
0.165 acre) of land are used to produce Chinese medicinal materials every year, with an 
output of about 400,000 tons. Currently, there are more than 1000 herbal pharmaceuti- 
cal factories in China, producing more than 8000 herbal drug products in more than 40 
different kinds of dosage forms. There are more than 30,000 wholesale and retail shops 
for herbal medicines in China (62-64). With the back-to-nature trend and the fact that 
China recently became a member of the World Trade Organization (WTO), an increase 
in the use of Chinese herbal medicine is expected, with a projected $400 billion world- 
wide market by 2010 (59). 

The seemingly overwhelmed herbal market will facilitate new herbal product devel- 
opment. Since the late 1970s, the Chinese pharmaceutical industry has achieved a very 
high growth rate, averaging 17.7% annually, a number much higher than the average 
growth rate of its national GDP (8%). The consumer market of pharmaceutical prod- 
ucts increased from annual sales of $1.83 billion USD in 1990 to $18 billion USD in 
2002, nearly a 10-fold increase, and has become one of the fastest-growing sectors in 
China (Fig. 2). At the end of 1999, 28 herbal drug products from 107 Chinese manufac- 
turers reached individual annual sales beyond $12 million USD (65). 

The herbal drug industry is more profitable than most other industries in China. The 
role of government in managing the TCM industry has changed fundamentally as a 
result of China's economic reform and opening policies, especially the in-depth reform 
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of the social security, health care, and medical insurance systems. According to the 
latest data from the Chinese National Economy and Trading Commission, the annual 
growth rate of the TCM industry has averaged 20%, with a profit rate reaching 24% 
annually in the past decade (62,63). From the latest survey of Chinese industries by the 
National Bureau of Statistics, the TCM industry is among the best (as judged by eight 
economic indices), in comparison with the 41 other industries, including petroleum 
and rubber. Its profit margin ranked number two, next to the tobacco industry. The 
Strategic Development Center of the State Council in China has recently completed a 
project entitled "The Strategic Development of Traditional Chinese Medicine," under 
the ninth Five-Year Strategic Plan, in which the general competitiveness of about 10 
industries, including herbal drugs, food manufacturing, pharmaceutical manufactur- 
ing, and electronic and communication equipment, were evaluated based on 12 indices. 
From the report, the TCM industry was ranked number four and was believed to have 
great potential, with strong development capability, creativity, and economy-driving 
impact (66). Government funding of TCM research and development has been and will 
be continually increasing. 

3. Modernization of TCM 

Medicines are products that are claimed to treat, cure, mitigate, or prevent a disease, 
and are regulated by national authorities. Since the establishment of the US FDA in 
1938, all drugs have had to be proven safe for their intended use to gain FDA premarket 
approval (unless they had been "grandfathered" as old drugs) (84). Since 1962, approval 
has also required that efficacy should be shown in adequate and well-controlled stud- 
ies. Furthermore, all drug manufacturers must follow current good manufacturing prac- 
tices (CGMPs) to assure quality and standardization of their drugs, and must list their 
facilities and products with the FDA (85-88). The Chinese government established 
their own standard for TCM drugs and dietary supplements by comparing different 
criteria from the United States, European Union, and Japanese systems. 

Systematic government agencies are also established to monitor and regulate affairs 
in the health care and pharmaceutical industry: the State Administration of Industry & 
Commerce (SAIC) is in charge of ethical promotion and business transition; the State 
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Table 1 

Resources of Chinese Materia Medica 



Resource 


Family 


Genus 


Species 


Percentage (%) 


Medicinal plants 
Medicinal animals 
Medicinal mines 

Total 


383 
395 


2309 
862 


11,146 

1581 

80 

12,807 


87.0 
12.3 
0.63 



Development and Planning Commission (SDPC) controls prices; the State Food and 
Drug Administration (SFDA) focuses on drug regulatory issues; the Ministry of Labor 
and Social Security (MLSS) deals with issues of medical insurance, reimbursement, 
and co-payment; the Ministry of Health (MOH) is responsible for health service and 
hospitals. Relatively detailed documents are written on a complete system of quality 
control, supervision, and standardization of TCM, such as current good agriculture 
practice (CGAP), to protect the medicinal plant resource for sustainable development; 
CGMP for manufacturing; current good laboratory practice (CGLP) for consistent 
bookkeeping; and current good clinical practice (CGCP) for TCM development. Other 
standard operation protocols, such as current good quality control practice (CGQP) 
and current good extraction practice (CGEP) are proposed and will be implemented 
soon. 

3. 1. The Status of Medicinal Plant Resources in China 

More than 1 1 % of the world plant species are found in China, including 240 rare gen- 
era. A national survey indicated that China had 12,807 species of medicinal materials 
(67,68) (Table 1), in which 11,146 species (9933 taxonomic species and 1213 taxonomic 
units under species) are medicinal plants, including 10,687 species of seed plants, bryo- 
phytes, or pteridophytes and 459 species of algae, bacteria, fungi, or lichens (Table 2). 

As a result of over-exploitation, the reserves and production of wild medicinal plants 
gradually decreased. For example, Radix Glycyrrhiza uralensis was originally produced 
in the province of Inner Mongolia, but its reserve and production has decreased very 
quickly. Recently, the annual production of Radix Glycyrrhiza was reduced by 40% 
compared to the 1950s, and the main harvest place of this plant shifted to Xinjiang 
Province (69). The wild plant output of another common medicinal herb, Radix Astraga- 
lus membranaceus and Radix Astragalus mongolicus, was more than 2000 tons in the 
1960s, but has decreased to less than 100 tons in recent years (70). Many other medici- 
nal plant species are in danger of extinction, such as Acanthopanax senticosus; 
Atractylodes lancea; Anemarrhena asphodeloides; Asarum sieboldii; Cistanche salsa; 
Cynomorium songaricum; Dichroa febrifuga; Ephedra sinica; Gastrodia elata; 
Gentiana macrophylla; Gentiana scabra; Glycyrrhiza uralensis; Lithospermum 
erythrorhizon; Notopterygium incisum; Paris polyphylla; Phellodendron amurense; 
Pinellia ternate; Rheum officinale; Saposhnikovia divaricata; Scutellaria baicalensis; 
Stellaria gypsophiloides; Tripterygium wilfordii; Uncaria rhynchophylla; Vitex trifolia; 
and Ziziphus jujuba. 
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Table 2 

Medicinal Plant Resources in China 



Resource 


Families 


Genera 


Species 


Algae 


42 


56 


115 


Bacteria 


40 


117 


292 


Fungi 


9 


15 


52 


Lichens 


21 


33 


43 


Bryophytes 


49 


116 


456 


Pteridophytes 


222 


1972 


10,188 


Seed plants 


383 


2309 


11,146 



The Chinese government has recognized the situation and has begun to implement 
CGAP to protect the medicinal plant resources for sustainable development. Thus far, 
around 600 CGAP cultivation bases have been established nationwide, and the total 
cultivation area of medicinal plants has reached 6 million mus (71-74). The research 
institutes, herbal drug manufacturing plants, and local farmers are the three important 
and indispensable participants in this effort. The CGAP cultivation base for each drug 
material is usually built in its genuine or original production place(s) to achieve the 
geo-authenticity of the plants. Table 3 lists some significant CGAP cultivation bases. 

3.2. Status of Chinese Prepared Medicinal Herbs 

In the process of the TCM industry, there is one sector called "prepared medicinal 
herbs," situated between the cultivation of medicinal plants and extraction/manufac- 
turing of herbal drug products. The manufacturers specializing in prepared medicinal 
herbs collect, clean, cut, and sometimes process the herbs through boiling or steaming 
procedures. Although many researchers in the industry suggest that a good processing 
practice (GPP) should be put in place, much less research and process development 
work has been done to improve the processing of Chinese prepared medicinal herbs, as 
compared to CGAP in the cultivation of medicinal materials and CGMP in the produc- 
tion of herbal drug products. 

By the end of 2001, there were 1175 Chinese herbal drug factories, equivalent to 
18% of the total number of the medical and pharmaceutical manufacturing plants in 
China. Currently, there are 48 manufacturing plants of prepared medicinal herbs, 14 of 
which are operating at a deficit. Sales of prepared medicinal herbs in 2001 reached $60 
million USD, accounting for 1.08% of the total market of herbal drugs. Table 4 lists 
the top 18 large manufacturers of prepared medicinal herbs in China (61,75). 

The quality control methods and standards of Chinese prepared herbs are being 
established with modern analytical technologies, including microscopic analysis and 
chromatographic fingerprints. In addition, regulatory and market control laws are being 
strengthened to eliminate false and low-quality Chinese prepared herbs. 

To meet the growing demand for consistency in herbal material handling and quality 
control, a new form of prepared medicinal herb, the granule of herbal extracts, has 
recently been developed and marketed in the major TCM hospitals in China. The two 
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Table 3 

Some Current Good Agriculture Practice Bases of Chinese Materia 

Medica in China 



Site of bases 



Cultivated medicinal plants 



Anhui province 
Chongqing 
Gansu province 
Guangxi province 
GuiZhou province 
Hebei province 
Heilongjiang province 
Henan province 
Hubei province 
Hunan province 
Inner Mongolia 
Jiangsu province 
Jilin province 
Liaoning province 
Ningxia province 
Shandong province 
Shanghai 
Shanxi province 
Sichuan province 
Tibet province 
Yunnan province 



Paeonia suffruticosa 

Pinellia ternata 

Angelica sinensis 

Momordica grosvenorii 

Dendrobium candidum; Eucommia ulmoides; Ginkgo biloba 

Angelica dahurica; Scutellaria baicalensis 

Panax ginseng 

Rehmannia glutinosa 

Lespedeza cyrtobotrya 

Eucommia umloides 

Glycyrrhiza uralensis 

Chrysanthemum morifolium 

Panax ginseng; Panax quinquefolium 

Panax ginseng 

Lycium chinensis 

Lonicera japonica 

Crocus sativus 

Salvia miltiorrhiza 

Crocus sativus; Ligusticum chuanxiong 

Rhodiola rosea 

Dracaena draco; Panax notoginseng 



major suppliers are Guangdong E-fong Pharmaceuticals and Jiangyin Tianjiang Phar- 
maceutical Company. The prepared herbal granule is made by a modern technique of 
processing single medicinal herbs, in which herbs are boiled until a thick syrup emerges 
and then are dried by a combined spray drying and fluidized bed drying technique. 
Granules are considered to have the highest effectiveness among all the preparations, 
because they maintain the most active ingredients through such a process and retain 
potency for a long time in storage. Nearly 500 kinds of prepared herbal granules have 
been made available on the market, and many TCM practitioners and consumers prefer 
to use such materials for combination formula preparations because granules are easier 
to keep and handle, and require lower amounts per volume than liquid extracts or raw 
materials. At the same time, this new form of prepared medicinal herbs is being chal- 
lenged by many conventional herbal drug manufacturers and researchers. One of the 
major critiques is that the use of such a modern preparation in traditional combination 
formulas omitted an important step, i.e., decoction of mixed raw herbal materials, where 
synergistic chemical interactions occur to enhance activity and reduce toxicity; this 
does not comply with TCM philosophy. Additionally, the direct use of granule prepa- 
rations in hospitals is not regulated or controlled. 
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Table 4 

Top 18 Manufacturers of Chinese Prepared Medicinal Herbs in 2001 

No. Name of the manufacturing plants Sales ($ 1,000 USD) 

1 Xinjiang Tefeng Pharmaceutical Inc. 8605 

2 Shenzhen Jinchun Pharmaceutical Co. Ltd. 7321 

3 Shanghai Tong-han-chun-tang Manufacturing Plant 6343 

of Prepared Medicinal Herbs 

4 Tianjin Manufacturing Plant of Prepared Medicinal Herbs 4841 

5 Shanghai Lei-yun-shang Plant of Prepared Medicinal Herbs 471 1 

6 Shanghai Xu-chong-dao Manufacturing Plant of Prepared Medicinal Herbs 3865 

7 Shanghai Baoshan Manufacturing Plant of Prepared Medicinal Herbs 2131 

8 Shanghai Xuhui Manufacturing Plant of Prepared Medicinal Herbs 1929 

9 Beijing Renwei Manufacturing Plant of Prepared Medicinal Herbs 1818 

10 Qijia Prepared Herbals Co. Ltd. of Heibei Province 1811 

1 1 The Prepared Herb Manufacturing Plant of Shanghai Jiabo Pharmacy 1810 

12 Hebei Meizhu Company of Chinese Materia Medica 1594 

13 The Sichuan Tibet Plateau Pharmaceuticals Co. Ltd. 1529 

14 The Beiqi Pharmaceuticals of the Greater Xing'an Mountains 1504 

15 Anguo Guangming Manufacturing Plant of Prepared Herbs 1384 

16 Shuangqiao Yanjing Manufacturing Plant of Prepared Medicinal Herbs 1 196 

17 Shanghai Yangpu Manufacturing Plant of Prepared Medicinal Herbs 1131 

18 Liuzhou Shennong Manufacturing Plant of Prepared Medicinal Herbs 945 



3.3. Status of TCM Manufacturing Enterprises 

In the 1950s, the professional factories making herbal drug products were situated in 
herbal pharmacies, with simple and crude equipment and poor preparation technology. 
Through technology innovation and transformation, production by these herbal drug 
manufacturers has increased tremendously. By the end of 1995, there were 1020 herbal 
drug enterprises in China (of which 158 were large and medium-sized) with a total 
output value of $2 billion USD (of which the large and medium-sized enterprises con- 
tributed about 60%) (76). In this industry, the conventional mode of production was 
gradually replaced by modern facilities and technology, and administrative and techni- 
cal standards have been greatly enhanced. With the reform of the TCM industry, the 
working environment, the level of technology and equipment, and the total quality 
control of products have been upgraded to CGMP standards in about two-thirds of the 
TCM manufacturers. Thus far, approx 8000 herbal drug products, prepared medicinal 
herbs, and natural health food products of different dosage forms, including pills, pal- 
lets, capsules, granules, syrups, injectables, topical creams and ointments, patches, 
aerosols, and so on have been supplied to the market, with consistently good quality, 
reasonably high curative effect, and few side effects. By the end of 2001, the total 
output value of the herbal drug industry reached $10-13 billion USD, and a number of 
pharmaceutical companies specializing in herbal drug product manufacturing and mar- 
keting have evolved (see Tables 5 and 6), with advanced processing technology, strong 
R&D, and effective marketing capabilities (62,77). 
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Table 5 

The Top 20 Chinese Herbal Drug Manufacturers in 2001 

No. Name and place Sales ($Million USD) 

1 Taiji Group Ltd., Chongqing 317 

2 Tianjin Zhong-xin Pharmaceuticals Ltd., Tianjin 262 

3 Shanghai Lei Yun Shang Pharmaceutical Co. Ltd., Shanghai 190 

4 Hui Ren Group Co. Ltd, Jiangxi Province 188 

5 Tianjin Tasly Group, Tianjin 164 

6 China Beijing Tong-ren-tang (Group) Co. Ltd., Beijing 137 

7 Xiu Zheng Pharmaceutical Co. Ltd., Jilin Province 125 

8 Chengdu Diao Group, Sichuan Province 106 

9 Chitai Qing Chun Bao Pharmaceutical Co., Zhejiang Province 94 

10 Zhenzhou Hua Xia Medical and Health Product Co. Ltd., Henan Province 92 

11 Shandong Lu Nan Pharmaceutical Inc., Shandong Province 83 

12 Shanghai Traditional Chinese Materia Medica Co., Shanghai 81 

13 Guilin San Jin Pharmaceuticals Co. Ltd., Guangxi Province 73 

14 Dong-E E-geltin Group Ltd., Shandong 65 

15 Tong Ren Tang Science and Technology Development Co. Ltd., Beijing 62 

16 Guangxi Jin Shang Zi Co. Ltd., Guangxi Province 60 

17 The Guangzhou First TCM Pharmaceutical Plant, Guangdong Province 59 

18 Shi Jia Zhuang Shenwei Pharmaceuticals Co. Ltd., Hebei Province 57 

19 Jiu Zhi Tang Inc., Hunan Province 48 

20 Qingdao Guofeng Pharmaceuticals Inc., Shandong Province 47 



Compared with 1999, annual sales in 2001 increased by 41.3%, whereas total profit 
value increased by 49.0%. There were eight herbal drug companies having a net income 
of more than 100 million Yuan (Table 7). There were 284 enterprises running deficits, 
accounting for 25.7% of the total number (62,76,77). 

One of the obvious problems with the herbal drug industry is that there are many 
small businesses (more than 70% of the total number of the TCM manufacturers) whose 
product pipelines and technology are out of date, and competitiveness and financial 
status are relatively poor. These small enterprises are becoming a main source of eco- 
nomic loss and employee layoffs in the TCM industry. 

3.4. Status of Plant Extracts 

China has recently become one of the world's largest medicinal plant exporters and 
importers. It is reported that China exported nearly 144,000 tons of medicinal plants 
annually from 1993 to 1998 (78). The medicinal plants were sold to more than 90 
foreign countries, such as the United States, the Republic of Korea, and Japan. Mean- 
while, China imported about 9200 tons of medicinal plants each year over the same 
period from more than 30 foreign countries. The demand for medicinal plants around 
the world has increased in recent years, as more and more people preferred natural 
therapies. Because the United States is the largest consumer of plant extracts (the mar- 
ket of diet supplements based on plant extracts reached $337 million in 2001, and the 
sales of the top 20 plants amounted to 96% of the market, see Table 8), the majority of 
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Table 6 

Comparison of Economic Indices for Herbal Drug Manufacturers in 1999-2001 

Fixed asset Output value Sales Total profit 

Year No. of enterprises (Smillion USD) ($million USD) (Smillion USD) (Smillion USD) 

1999 1033 2434 4807 4379 459 

2000 1072 2657 5982 5537 608 

2001 1104 2841 6621 6189 684 



extract products are sold to the United States. The quantity of plant extracts supplied 
from China accounts for 7% of the world market (78). 

Currently there are approx 200 companies manufacturing and supplying plant extract 
products; most are small businesses, and the largest has annual sales of less than $10 
million USD. There are only seven or eight suppliers capable of exporting over $4 
million USD products each year. Main products are extracts from various plants, includ- 
ing grape seed, pine bark, apple, astragalus, cat's claw, celery seed, chrysanthemum, 
corn silk, epimedium, ginkgo leaf, ginseng, giant knotweed, hawthorn leaf, red yeast 
rice, rhodiola, serrate clubmoss, shiitake mushroom, soy bean, tobacco, uniflower 
swisscentaury, wolfberry and so on (78). 

New drug formulations and delivery systems are also emerging. For example, 
nanoparticles (NPs) are composed of solid colloidal particles ranging in size from 1 to 
1000 nm (97), and their primary advantages are stability both in the body and in stock, 
site-specific targeting, and slow release. The application of NPs in herbal medicine 
should increase the effectiveness of herbs. 

3.5. Status of Preclinical and Clinical Research 

The experience of TCM needs to be substantiated and advanced by the available 
toolbox of science and technology. Quantitative and qualitative analysis, biological 
activity, bioavailability, absorption, metabolism, elimination, toxicity, and mode of 
action studies have been employed using either cell or animal models (113-115). 
Randomized controlled clinical trials and double-blind experiments have been done to 
address the question of efficacy (116-118). A detailed warning label should be issued to 
notify consumers and medical practitioners about potential harm and assure that the 
claimed health benefits were really there (119). 

4. The Internationalization or Globalization of TCM 

TCM constitutes a multi-billion-dollar industry worldwide, and more than 1500 
herbals are sold as dietary supplements or ethnic traditional medicines. Table 9 lists 
the exports of herbal drug products and Chinese material medica, including plant 
extracts, from 1990 to 2001 (62). However, the great potential of TCM has not been 
reached. There are many challenges in the course of herbal product internationaliza- 
tion, and the entire industry is making a great effort to advance into the international 
market. 
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Table 7 

Most Profitable Herbal Drug Manufacturers in 2001 

No. Name of the manufacturers Profit ($million USD) 

1 Chengdu Diao Group 29.6 

2 Tianjin Tasly Group 28.8 

3 Xiu Zheng Pharmaceuticals Group Ltd. 28.3 

4 China Beijing Tong Ren Tang (Group) Co. Ltd. 27.7 

5 Chitai Qing Chun Bao Pharmaceuticals 22.6 

6 Gong-E E-gelatin Group Ltd. 17.1 

7 Guilin San Jin Pharmaceuticals Co. Ltd. 14.4 

8 Jilin Au Dong Group Ltd. 13.8 

9 Tong Ren Tang Sci. & Tech. Development Co. Ltd. 11.9 

10 Tianjin Zhong-xin Pharmaceuticals Ltd. 11.3 

11 Jiang Zhong Pharmaceutical Co. Ltd. 10.5 

12 Tonghua Jinma Pharmaceutical Co. Ltd. 10.3 

13 Shandong Lu Nan Pharmaceutical Inc. 10.1 

14 Zhangzhou Pian Zai Huang Pharmaceutical Co. Ltd. 10.1 

15 Tibet Qi Zheng Pharmaceutical Co. 9.4 

16 Guangxi Jin Shang Zi Co. Ltd. 9.4 

17 Yunnan Bai Yao Pharmaceuticals Group Ltd. 9.3 

18 Tianjin Guang Xia Group Ltd. 9.3 

19 Zhejiang Kang Eng Bei Group Ltd. 9.2 

20 The Guangzhou First TCM Pharmaceutical Plant 9.0 



4. 1. Open Dialog and Mutual Understanding Among All Parties Involved 
in Health Care System 

Western medicine is built on reproducible experiments and statistical analysis, 
whereas CAM and TCM are built on clinical experience. Compared with conventional 
Western medicine, TCM is poorly researched. Many studies in TCM therapies have 
flaws, such as insufficient statistical power, poor controls, inconsistent treatment, and 
lack of comparisons. TCM is one of the oldest evidence-based alternative and comple- 
mentary therapies, and its formulations are often not subjected to premarket toxicity 
testing. With the booming market of TCM worldwide, a new strategy must be formu- 
lated for the assessment of drug efficacy and toxicity to optimize the therapeutic and 
preventive potential of Chinese herbal medicine. 

The validation of TCM theory is perhaps more important than the commercializa- 
tion of herbal drugs in the international market. The international standardization of 
traditional medical terms and practices will facilitate worldwide acceptance of TCM 
products. The safe and effective application of TCM has much to do with the skills of 
traditional medical practitioners as well as the basic understanding of the mechanism 
of TCM. As an example, Ginkgo biloba is a dioecious tree with a long history of TCM. 
The standardized extracts of its leaves have been used widely as a phytomedicine in 
Europe and as a dietary supplement in the United States. The primary active constitu- 
ents of its leaves include flavonoid glycosides and unique diterpenes known as 
ginkgolides; the latter are potent inhibitors of platelet activating factor (94). Clinical 
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Table 8 

Sales of 20 Herbal Extracts in International Mmarket in 2002 



Rank 


Botanical name 


Active ingredients 


Sales ($million USD) 


1 


Gingko 


Flavones/Lactones 


5.8 


2 


Echinacea 


Total phenolics 


5.0 


3 


Garlic 


Alicins 


4.4 


4 


Ginseng 


Saponins 


3.9 


5 


Soy bean 


Isoflavones 


3.5 


6 


Saw Palmetto 


Fatty acids 


3.1 


7 


St. John's wort 


Hypericins 


3.0 


8 


Valerian 


Valeric acids 


1.5 


9 


Bilberry 


Anthocyanidins 


1.3 


10 


Black Cohosh 


Triterpenes 


1.2 


11 


Kava Kava 


Lactones 


1.2 


12 


Milk Fruit 


Silymarins 


0.9 


13 


Evening primrose 


Oils 


0.7 


14 


Grape seed 


Proanthocyanidins 


0.5 


15 


Yohimbin 


Alkaloids 


0.3 


16 


Green tea 


Polyphenols 


0.2 


17 


Pycnogenol Pine 


Polyphenols 


0.2 


18 


Rhodiola rosea 


Salidrosides, rosavins 


0.2 


19 


Ginger 


Gingerols 


0.2 


20 


Pyrethrum 


Total flavones 


0.1 


Total 






41.9 



studies have shown that ginkgo extracts exhibit therapeutic effects in a variety of con- 
ditions, including Alzheimer's disease, memory loss, age-related dementia, cerebral 
and ocular blood flow occlusion, premenstrual problems, and altitude sickness (95). As 
a result of its potent antioxidant properties and ability to enhance peripheral and cere- 
bral circulation, ginkgo shows prospective value in the treatment of cerebrovascular 
dysfunctions and peripheral vascular disorders (96). 

4.2. Deciphering the Preventive Nature of TCM 

A very important philosophy for TCM is to cure disease before it happens — the 
prototype of preventive medicine (98-101). Three functional levels of preventive medi- 
cine have been mentioned in the past: preventing the occurrence of disease and injury; 
early detection and intervention, by reversing, halting, or retarding the progression of a 
disease; minimizing the effects of disease and disability by surveillance and mainte- 
nance to prevent complications. 

As an example, cardiovascular disease is a complex and multifactorial disease, 
characterized by such factors as high cholesterol, hypertension, reduced fibrinolysis, 
increase in blood-clotting time, and increased platelet aggregation. Evidence from 
numerous studies points to the fact that garlic can bring about the normalization of 
plasma lipids, enhancement of fibrinolytic activity, inhibition of platelet aggregation, 
and reduction of blood pressure and glucose (120). Drug developers in the oncology 
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Table 9 

Annual Export of Herbal Products From 1990 to 2001 



Year 


Chinese material medica 


Herbal drug products 


Nation's total exports 




(in $1000 USD) 


(in $1000 USD) 


(in $million USD) 


1990 


304,150 


112,870 


62,091 


1991 


241,250 


126,680 


71,843 


1992 


241,340 


135,280 


84,940 


1993 


329,190 


120,910 


91,744 


1994 


537,210 


130,260 


121,006 


1995 


537,010 


135,350 


148,780 


1996 


443,570 


126,000 


151,048 


1997 


424,650 


107,520 


182,792 


1998 


332,130 


84,520 


183,712 


1999 


295,170 


73,870 


194,931 


2000 


348,990 


90,940 


249,212 


2001 


354,000 


102,000 


266,154 



arena have much to learn from their cardiovascular counterparts. The quest for primary 
and secondary cancer prevention has been ongoing for some years. One of the most 
debated is the anti-inflammatory drug COX-2 for colorectal carcinoma and other types 
of cancer. Other examples include hormonal therapy for prevention of prostate and 
breast cancer. Tamoxifen, a selective estrogen receptor modulator (SERM), was actu- 
ally approved for cancer prevention for women with high risk of breast cancer in 1998, 
thus being the first cancer prevention drug ever approved by the FDA (121-127). 

The further validation of preventive medicine is awaiting the advancement of bio- 
statistics, epidemiology, gene biomarker identification, and other diagnostic tools in 
disease initiation and progression. One promising example of cancer-preventive effects 
that are not specific to any organ is Panax ginseng, an herb with a long medicinal history. 
The genus name of ginseng, Panax, is derived from the Greek pan (all) akos (cure), 
meaning "cure-all." No single herb can be considered a panacea, but ginseng comes 
close to it. Ginseng is a tonic herb, or an adaptogen that helps to improve overall health 
and restore the body to balance, and helps the body to heal by itself. Its protective 
influence against cancer has been shown by extensive preclinical and epidemiological 
studies (89,90). Ginseng is a slow-growing perennial herb, reaching about 2 ft in height. 
The older the root, the greater is the concentration of ginsenosides, the active chemical 
compounds; thus the ginseng becomes more potent with time. More than 28 ginsenosides 
have been extracted from ginseng, and might be associated with a wide range of thera- 
peutic actions in the central nervous system (CNS) and cardiovascular and endocrine 
systems (91). Indeed, ginseng promotes immune function and metabolism, and pos- 
sesses antistress and anti-aging activities. Several ginsenosides were proven to be non- 
organ-specific tumor suppressors and to improve learning and memory in patients with 
Alzheimer's disease (91-93). 
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4.3. Respect and Protect Intellectual Property 

The Dietary Supplement Act of 1994 opened the door for a surge of products into 
the nutraceutical marketplace (86). However, maintaining exclusive market rights for 
any natural product remains challenging. Many people in the natural-products indus- 
try perceive few opportunities to obtain defensible patents for natural products that 
have been used in the public domain for many years. Thus, with little chance to achieve 
market exclusivity and without a requirement for premarket approval, companies have 
little incentive to conduct expensive clinical trials of dietary supplements. However, 
opportunities do exist with respect to identifying new active ingredients, improving 
methodologies of secondary metabolites production, applying modern technology to 
discover new modes of action of TCM (102), and fingerprinting the new natural prod- 
uct remedy. Therefore, it is wise to seek multiple facets of intellectual property protec- 
tion. The most common form for a new natural-product remedy is patent protection. A 
patentable invention needs to be novel, useful, and nonobvious (inventive). In some 
instances, these remedies may qualify for protection as trade secrets. With the exist- 
ence of many manufacturers of the same type of product, creating a brand name that 
can assure high quality may be the ultimate challenge. 

4.4. Safety Issues of TCM Need To Be Seriously Evaluated 

Widespread favor of TCM also brings serious concern about its safety, regulation, 
efficacy, and mode of action (103,104). In some cases, the use of TCM has been con- 
nected to many undesirable side effects, resulting in nephropathy, acute hepatitis, coma, 
and so on (105-108). Cases of poisoning have been linked to variations in the chemical 
composition of different brands of the same herb. Such differences may arise as the 
result of inadequate processing (processes normally involve soaking and boiling the 
raw material), resulting in toxins being retained or adulteration with cheaper substi- 
tutes (109-111). Cases of poisoning may also be attributed to contamination by heavy 
metals (112). 

TCM is guided and supported by medical theory. There will be serious consequences 
if the drugs are used but the traditional medical theory is discarded. People in many 
countries are using Chinese herbal medicines as daily diet supplements without having 
a basic understanding of Chinese medical theory or the rationale behind their use. One 
of the most serious examples is the case of xiao-chai-hu-tang (decoction of bupleuri for 
regulating Shaoyang), observed in Japan. This ancient formula was used to treat febrile 
diseases in Shaoyang meridian with symptoms of alternate attacks of chills and fever, 
fullness in the chest, discomfort, dizziness, dry throat, vomiting, and so on (80). Based 
on modern pharmacological findings, some TCM practitioners used such a formula to 
treat hepatitis specifically showing the above symptoms and obtained reasonably good 
results. However, the formula was widely prescribed in Japan for the long-term treat- 
ment of all types of hepatitis, and unfortunately, resulted in severe adverse effects, 
including a number of deaths (81). 

It was recently reported that a specific type of nephropathy occurred due to inges- 
tion of certain Chinese herbs such as Aristolochiae manshuriensis. The case highlighted 
the role of aristolochic acid and its metabolite, aristololactam I, in causing this nephr- 
opathy, which was first observed in a Belgian cohort (82). The phenomenon, now called 
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"Chinese herbal nephropathy," led to a public misconception of the toxic nature of 
TCM. While further research on aristolochic acid-containing herbs is being conducted 
in China (83), we would like to make the following points: In ancient times, the herbs 
Mu-tong (Akebia quinata), San-ye-mu-tong {Akebia trifoliate), and Bai-mu-tong 
(Akebia quinata var. australis) were used in combination formulas rather than the herb 
Guan-mu-tong (Aristolochiae manshuriensis). The three herbs used in ancient formu- 
las have no aristolochic acid and have been replaced nowadays with Guan-mu-tong 
due to much decreased resources. As a result, they were no longer collected in the 
Chinese Pharmacopoeia 2000. Such a replacement greatly altered the toxicology of 
these Mu-tong-containing formulas. Secondly, the preparation of many TCMs nowa- 
days has been greatly modified for convenience, i.e., the drug decoction process is 
replaced with "instant" water-soluble dosage forms of herbal extracts. This has mini- 
mized chemical interactions including drug detoxification among herbal medicines. 
Additionally, the use of a single herbal medicine, or refined fractions in large quantity, 
may exhibit certain therapeutic activities, but are often more toxic and less efficacious 
than combination formulas. Any use of herbal products in high dosage or for long-term 
medication is not advisable unless patients see a practitioner for a diagnosis and holis- 
tic approach to their conditions and obtain a customized and personalized prescription, 
taking the various manifestations of their symptoms and perceived causes into full 
account. In fact, a carefully prepared combination formula works rather differently, 
because it contains synergistic and balanced elements that may interact in different 
ways and neutralize the negative effects of some toxic constituents that the plants might 
contain. This is the hypothesis, but the public is entitled to have medicines that are 
proven to work by rigorous tests and that are safe and cost-effective. The standards and 
the criteria for judging the safety and the effectiveness of treatment and diagnostic 
interventions must be formulated by critical scientific evaluation from practitioners 
with different epistemology. Satisfaction from patients using both Western medicine 
and TCM has been scored and compared in Korea (26). 

4.5. The Small Scale and Limited Capability of Herbal Drug Enterprises 

Most Chinese pharmaceutical companies specialized in herbal drug manufacturing 
and marketing are currently very weak in terms of their product quality, proprietary 
technology, international sales, and overall competitiveness. Those TCM enterprises 
are also weak in terms of sales, profit, and equity capital in the international pharma- 
ceutical industry. China's largest herbal drug company, Taiji Group (Table 5), has 
annual sales of only $300 million USD. Such herbal drug companies will have finan- 
cial and technical difficulties in the development of international business. The R&D 
investment by these companies supported by their revenues will be limited. This vicious 
circle will prevent them from experiencing rapid and sustainable growth with a well- 
structured product pipeline (79). At present, the majority of manufacturers primarily 
produce products in traditional dosage forms, such as large spherical pills consisting of 
powders of raw materials which are not subjected to sophisticated extraction processes 
and are inconsistent in quality. 

There are more than 1000 companies in China making TCM, but large and medium- 
sized enterprises account for less than 20% of the total business players in the Chinese 
market. There are too many companies scrambling into the same business with almost 



Chinese Herbal Drug Industry 245 

the same strategies, leading to a glut in production that reduces their profit. A limited 
high-tech content in the production of TCM has also prevented China from competing 
effectively on the world stage (79). Merger and acquisition should take place to create 
firms large enough to be cost-effective. 

5. Closing Remarks 

Historical precedent predicts the potential of TCM to expand the health-care reper- 
toire, either as single-chemical entities or as complex botanical drugs. With an increase 
in the aging population and changes in the epidemiology of health problems (such as 
chronic and degenerative diseases) that have frustrated Western medicine, TCM and 
CAM will gain more popularity in the new millennium. Most survey results in Western 
countries showed there is considerable interest in TCM and CAM among primary-care 
professionals, and many are already referring or suggesting referrals. Such referrals are 
driven mainly by patient demand and by dissatisfaction with the results of conven- 
tional medicine. The trend of integrating TCM and CAM into mainstream primary care 
is unstoppable. There is an urgent need to further educate and inform primary-care 
health professionals about TCM and CAM, which bring unprecedented opportunities 
and challenges to TCM specifically. Modernization and globalization of TCM is the 
right way to go. We strongly believe that the future of TCM as a healing art is promis- 
ing, with its success in research and education and, very importantly, in commercial- 
ization in the international market. With the further development of systems biology, 
pharmacogenomics, synergistic medicine, and personalized medicine, the real time for 
integration of TCM and Western medicine will come. It is hopeful that some day pa- 
tients will benefit from an integrated medicine — wisdom combining the strengths of 
both the East and the West. 
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Summary 

Cumulative evidence indicates that traditional Chinese medicine (TCM) is an important resource 
for discoveries of drugs against cancer. Arsenic, a common, naturally existing substance, is rarely found 
in its pure elemental state in nature. In addition to the organic arsenicals, there are three major 
inorganic arsenic forms: red arsenic (As 2 S 2 ), yellow arsenic (As 2 S 3 ), and white arsenic (As 2 3 , ATO). 
Based on the basic TCM theory of "using poison against poison," a group from Harbin Medical 
University in the northeastern region of China in the early 1970s introduced intravenous infusion of 
"Ailing- 1 (anticancer- 1)," a solution of crude ATO and herbal extracts, into cancer therapy. After a 
lengthy study in more than 1000 patients with various kinds of cancers, acute promyelocytic leukemia 
(APL), a specific subtype of acute myeloid leukemia characterized by the failure of differentiation/ 
maturation towards granulocytic cells at the promyelocytic stage, was determined to be an excellent 
target for "Ailing- 1" therapy. At the same time, there were reports of using the formula called "niu 
huang jie du pian" (containing xiong huang) or just using xiong huang (realgar) by itself to treat APL. 
A formula of Qing Dai (indigo) containing xiong huang as the main Chinese herb was reported to 
result in 98.3% total remission. 

Thus, some cautious clinical trials with pure ATO and state-of-the-art bench studies were 
conducted worldwide. The results revealed that ATO brings the majority of relapsed APL patients to 
a second complete morphologic, cytogenetic, and even molecular remission. Successful experiences 
that used ATO together with the differentiation-inducing agent all-trans retinoic acid (ATRA) and 
chemotherapeutic drugs make APL the most curable subtype of acute myeloid leukemia in adults, an 
unprecedented achievement in the field of hematologic malignancies. The clinical data from our 
group showing striking efficacy and safety of ATO in patients with APL led to clinical trials in the 
United States and subsequent approval by US Food and Drug Administration (FDA) in September 
2000. Inspired by such a bedside discovery, many investigators are researching pharmacological 
mechanisms of ATO, and finding that ATO exerts wide-spectrum cellular and molecular activities on 
APL cells and other cancer cells, such as growth arrest, apoptosis induction, differentiation induction, 
and anti-angiogenesis, although its exact mechanisms remain to be determined. More importantly, the 
clinical potential of ATO in other cancers beyond APL is also under study. In this chapter, we discuss 
recent clinical practice and the mechanism of ATO action on APL and other cancer cells. 

Key Words: Arsenic trioxide; acute promyelocytic leukemia (APL); clinical trials; differen- 
tiation; apoptosis; leukemia; cancer. 

1. Introduction 

Microbial and plant secondary metabolites helped to double our life span during the 
twentieth century, reduced pain and suffering, and revolutionized medicine. For instance, 

From: Natural Products: Drug Discovery and Therapeutic Medicine 
Edited by: L Zhang and A. L. Demain © Humana Press Inc., Totowa, NJ 

25 1 



252 Chen et al. 

all-trans retinoic acid (ATRA), a vitamin A derivative, has been successfully used for 
acute promyelocytic leukemia (APL) by inducing terminal differentiation of leukemic 
cells towards granulocytes, the only well-established example of cancer differentiation 
therapy so far (1,2). In the past 20 yr, a therapeutic strategy with ATRA and 
anthracycline-based chemotherapy for induction, anthracycline-based consolidation, 
and maintenance with ATRA and/or low-dose chemotherapy has kept more than 70% 
of APL patients alive and disease-free for 5 yr, an unprecedented achievement in the 
field of oncology (3). 

Increasing evidence demonstrated that herbs-based traditional Chinese medicine 
(TCM) could be used for treatment of many diseases, including cancer. As documented, 
some Chinese herbs, such as Qing Dai (Indigo naturalis), Ya Dai Zi (Brucea javanica), 
She Xiang (Moschus moschiferus), and E Shu (rhizoma Curcuma phaeocaulis), can 
directly kill cancer cells. Also, other herbs can inhibit cancer cell growth, such as Jiang 
Huang (Curcuma aromatica), Chang Chun Teng (Hedera helix), Tian Hua Feng 
(Trichosanthes kirilowii), Tian Nan Xing (Arisaemi japonicum), and Shan Zi Gu 
(Cremastra variabilis) (4,5). In addition, up to 100 kinds of Chinese herbs can stimu- 
late the patient's immune system to attack cancer cells (6). Taken together, Chinese 
herbs are becoming important resources for anticancer drug discovery (7). 

Chinese herbs are often mixed into formulae for clinical application. For example, 
the formula "Qing Huang San" (one of the popular formulae that grinds two herbs, 
Qing Dai and Xiong Huang, at a ratio of 9:1 into fine powder, which is loaded into 
capsules) was shown to effectively treat chronic myeloid leukemia (CML) (8). In 1995, 
Huang et al. from Da Lian city, China, used a formula combining these two herbs as 
main ingredients to treat 60 APL patients, which resulted in 98.3% experiencing total 
remission (9). Before that, there were reports of using a formula called "niu huang jie 
du pian" (containing Xiong Huang) or just using Xiong Huang (realgar) by itself to 
treat APL (8). 

APL, an unique subtype of acute myeloid leukemia (AML), is characterized by the 
blockage of granulocytic differentiation at the promyelocytic stage and specific recip- 
rocal chromosomal translocation t(15;17)(q22;q21). The latter leads to the expression 
of the fusion protein PML-RARoc (promyelocytic leukemia — retinoic acid receptor- 
ex), whose leukemogenic role has been demonstrated using transgenic mouse models 
(10). Although conventional chemotherapy such as anthracyclines and cytosine arabi- 
noside help about 70% of newly diagnosed APL patients to reach clinically complete 
remission (CR), a high frequency of early death (mainly due to exacerbation of bleed- 
ing syndrome) and a lower 5-yr disease-free survival (DFS) rate have emphasized a 
need for new drugs. Inspired by the Chinese philosophy that it is better to transform 
and reverse a bad element than to simply get rid of it, and by important discoveries in 
the 1970s and early 1980s that leukemic cells undergo phenotypic reversion in vitro 
when treated with some agents, the Shanghai Institute of Hematology (SIH) stalled to 
screen a large number of compounds. SIH identified ATRA as a strong differentiation- 
inducing agent for APL cells, and improved clinical efficacy as mentioned above. How- 
ever, there is still a 30% relapse rate, especially in APL patients having higher white 
blood cells (WBC) counts. In addition, relapsed patients after ATRA treatment are 
resistant to this drug, which becomes the main reason for failure to be cured (11,12). 
Fortunately, arsenic trioxide (As 2 3 , ATO, commercially named Trisenox™ [Cell 
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Therapeutics, Inc., Seattle, WA]) made the majority of relapsed and newly diagnostic 
APL patients achieve complete morphologic, cytogenetic, and even molecular remis- 
sion (13). Such an exciting ingredient, which was rapidly approved by the State Drug 
Administration (SDA) in China (13) and the Food and Drug Administration (FDA) of 
the United States (14), encouraged many investigators to focus on mechanisms of action 
of ATO on APL and to explore the potentials of ATO in treatment of cancers beyond 
APL. In this chapter, we shall concentrate on recent clinical practice and the mecha- 
nism of action of ATO on APL and other cancer cells. 

2. Historic Overview of Medicinal Arsenicals 

Arsenic, a common, naturally occurring substance, is rarely found in its pure elemen- 
tal state in nature. In addition to the organic arsenicals formed by covalent linking of a 
trivalent or pentavalent arsenic atom to a carbon atom, there are three major inor- 
ganic arsenic forms: red arsenic (As 2 S 2 , also named realgar or sandaraca), yellow 
arsenic (As 2 S 3 , referred to as arsenikon, aurum pigmentum, or orpiment), and white 
arsenic (ATO). That arsenic is a poison that has been known for many centuries. Its 
odorless and tasteless properties and acute/chronic poisoning has resulted in symptoms 
confused with a variety of other natural disorders, such as hemorrhagic gastroenteritis, 
cardiac arrhythmias, and psychiatric disease, which has made arsenic a common homi- 
cide or suicide tool since the Middle Ages. For example, Napoleon might have been 
poisoned by arsenic-tainted wine that was served to him in exile (15). At present, long- 
term exposure to arsenic is still a serious public health problem in some parts of the 
world, especially in developing countries. It has been estimated that tens of millions of 
people are at high risk of being exposed to excessive levels of arsenic from both con- 
taminated water and arsenic-bearing coal from natural sources (16). Apart from the 
poisoning, arsenic exposure has long been known to be correlated with human skin, 
lung, liver, kidney and urinary bladder cancers, although arsenic itself has not been 
shown to be carcinogenic in animal models to date. 

Paradoxically, arsenic usage has been medicinally practiced for 24 centuries, pro- 
viding a history of utility, dishonor, and redemption (for details, see ref. 10). Since the 
days of the ancient Greek and Roman civilizations, arsenic has been an active ingredi- 
ent in folk remedies in central and southern Asia. In the well-known Ben Cao Gang Mu 
(Compendium of Materia Medica) compiled by Herbalist Li Shizhen in the Ming 
Dynasty of China, arsenic was recorded as a treatment for several diseases. In the 
eighteenth century, Dr. William Withering (1741-1799) found that one of his patients 
with dropsy (congestive heart failure) improved remarkably after taking a traditional 
herbal remedy. He discovered that the active ingredient was digitalis, which was con- 
tained in the leaves of foxglove. His comment, "Poisons in small doses are the best 
medicines; and the best medicines in too large doses are poisonous," made him a strong 
proponent of arsenic-based therapies (17). Many arsenic preparations were used 
therapeutically in the 18th century. At that time, Thomas Fowler compounded a potas- 
sium bicarbonate-based solution of ATO named Fowler's solution to treat a variety of 
diseases, including asthma, chorea, eczema, and psoriasis. The solution became a stan- 
dard remedy to treat anemia, Hodgkin's disease, and leukemia until the early 20th cen- 
tury (17). Arsenic's reputation as a therapeutic agent was further explored in 1910 
when Nobel laureate and the founder of chemotherapy Paul Ehrlich developed 
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salvarsan, an organic arsenical, to treat syphilis, which is still used for trypanosomiasis 
today (17). Taken together, arsenic was one of the mainstays of the materia medica in 
the 19th century. 

Arsenic's antileukemic activity was first reported in the late 1800s. In 1878, a report 
from Boston City Hospital described the effect of Fowler's solution on the reduction of 
WBC counts in two healthy persons and one patient with "leukocythemia" (18). After 
discontinuation of treatment, the leukocyte count progressively increased until arsenic 
therapy was reinstituted. Arsenic then experienced a brief resurgence in popularity 
following a report in 193 1 of nine CML patients who responded well to ATO therapy. 
They experienced a reduction in total WBC counts and in the size of enlarged livers 
and spleens, a return to apparently normal hematopoiesis in bone marrow (BM) biopsy 
specimens, and a sense of well-being (18). However, Kandel et al. (19) reported the 
development of chronic arsenic poisoning in five of six CML patients and recom- 
mended careful monitoring with its application. Thereafter, the utility of Fowler's 
solution progressively declined, and it was supplanted by radiotherapy and cytotoxic 
chemotherapy. 

Nevertheless, a group from Harbin Medical University, in the northeastern region of 
China, has been challenging its disuse. They deeply believed in the basic theory of 
TCM, i.e., "using poison against poison," and determined that the possible adverse 
effects should not prevent physicians from applying arsenic to treat patients with life- 
threatening diseases. Actually, most chemotherapeutic agents in use today are 
genotoxic and carcinogenic. In the early 1970s, they re-introduced intravenous infu- 
sion of Ailing- 1 (anticancer- 1), a solution of crude ATO and herbal extracts, into can- 
cer therapy. After a lengthy study in more than 1000 patients with various kinds of 
cancer, APL was determined to be an excellent target for Ailing-1 therapy. In 1992, 
they revealed their preliminary clinical trials on APL patients, showing clinically CR 
was achieved in 21 out of 32 (65.6%) patients, with 5- and 10-yr survival rates of 
50.0% and 28.2%, respectively (20). Since 1994, a careful clinical trial with pure ATO 
and state-of-the-art bench studies conducted at SIH in collaboration with Harbin's 
group made the remarkable achievement well known in the world (21,22). Since then, 
these results have been confirmed in randomized clinical trials in the United States and 
other countries (23-26), resulting in a new era of medicinal arsenic. 

3. Clinical Experiences With Arsenic in the Treatment of APL 

3. 1. Pharmacokinetics 

In order to understand pharmacological mechanisms and potential toxicity of ATO 
in the treatment of APL, pharmacokinetic analysis in eight relapsed APL patients re- 
ceiving intravenous infusions of the present standard dose (0.16 mg/kg/d) of ATO were 
performed in our group (27). The results revealed that the plasma arsenic concentration 
rapidly reached the peak level of 6.85 (range, 5.54 to 7.30) umol/L of the mean maxi- 
mal plasma concentration (C pmax ). In one Japanese APL patient who developed BM 
necrosis during ATO treatment, the C pmax and half-life time was 6.9 umol/L and 3.2 h, 
respectively (28). The mean C pmax in six APL patients given a low dose of ATO (0.08 
mg/kg/d) fell to 2.63 (range, 1 .54 to 3.42) umol/L, which was nearly half the level of a 
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Table 1 

CR Rate in APL Treated With ATO in Some Clinical Trials 



Reference 


Status of the disease 


yV 


CR rate (%) 


Sun et al. (20) 


Primary + relapsed 


32 


65.6 


Zhang et al. (32) 


Primary 


30 


73.3 




Relapsed 


42 


52.4 


Shen et al. (27) 


Relapsed 


15 


93.8 


Soignet et al. (23) 


Relapsed + refractory 


12 


91.7 


Niu et al. (34) 


Primary 


11 


72.4 




Relapsed 


47 


85.1 


Soignet et al. (24) 


Relapsed + refractory 


40 


85.0 


Zhang et al. (33) 


Primary 


124 


87.9 




Relapsed + refractory 


118 


61.0 


Shen et al. (29) 


Relapsed 


20 


80.0 


Lazo et al. (26) 


Relapsed 


12 


100 



CR, complete remission. 



standard dose (29). Furthermore, continuous administration of ATO over 30 d did not 
change its pharmacokinetic behavior. It was very different from the observations with 
ATRA, the C pmax of which decreased progressively with continuous usage (30). How- 
ever, most of the time during treatment with either a standard or a low dose, the plasma 
concentration was maintained in the range of 0.1 to 0.5 umol/L. Of note was the report 
that the C pmax of ATO (median 0.46 umol/L) was slowly achieved at 8-15 d with a 
standard dose in a phase II trial of an ATO/interferon (IFN)-oc combination in seven 
patients with relapsed/refractory adult T-cell leukemia/lymphoma (ATL) (31). In both 
standard and low-dose groups, moreover, urinary arsenic content was slightly increased 
during drug administration, and the total amount of arsenic excreted daily in the urine 
accounted for less than one-tenth of the total daily dose. Arsenic accumulated gradu- 
ally in the hair and nails, and the peak level at CR stage was five to seven times higher 
than prior to treatment. However, the amount of arsenic in hair and nails tended to 
decrease after withdrawal of the drug (27,29). 

3.2. Higher Complete Remission Rate 

In 1997, we reported that 93.8% (14/15) of relapsed APL patients obtained CR with 
an intravenous drip of pure ATO at 0.16 mg/kg/d, which was diluted in 250 mL to 500 
mL of a 5% glucose-normal saline solution (27). Later, a series of clinical trials were 
carried out in various countries and regions, and a similar CR rate was obtained 
(Table 1). For instance, the report from Memorial Sloan Kettering Cancer Center in 
New York revealed that out of 12 relapsed APL patients receiving ATO, 1 1 got CR. It 
is worthwhile to note that treating 20 cases of relapsed APL with low-dose ATO (0.08 
mg/kg/d) yielded a similar CR rate (80%) (29). 

It should be mentioned that some groups, including ours (25,34), tried to use ATO to 
treat newly diagnosed APL patients and obtained similar CR rates. For example, 14 
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newly diagnosed patients were recruited to an ongoing trial of Mathews's group, dur- 
ing which ATO was administered at a dose of 10 mg/d until CR was achieved. After- 
wards, a consolidation course and a maintenance schedule consisting of ATO as a single 
agent were administered over 6 mo. Exclusive of three early deaths (two on d 3 and one 
on d 4) related to intracerebral hemorrhage and one patient who died on d 21 secondary 
to uncontrolled sepsis, 10 of the remaining 11 patients (91%) attained CR with the 
average time to CR of 52.3 d (range: 34-70 d). This included one unfortunate patient 
who developed an isolated central nervous system (CNS) relapse and subsequently 
went into a second CR following therapy with triple intrathecal chemotherapy, cranial 
irradiation, and an additional 4-wk course of systemic ATO (25). 

3.3. Correction of Coagulopathy 

APL cells contain several distinct procoagulant factors, including tissue factor (TF) 
and cancer procoagulants as well as plasminogen activators (t-PA, u-PA). These fac- 
tors induce generation of thrombin and activation of plasmin, resulting in disseminated 
intravascular coagulation and hyperfibrinolysis respectively, which are aggravated by 
chemotherapeutic drugs (35). Similar to ATRA treatment, ATO induction can improve 
the bleeding diathesis and coagulation-test abnormalities. Those are manifested by 
decreased soluble fibrin monomer complex, thrombin-antithrombin complex, fibrino- 
gen/fibrin degradation products, and D-dimer, and increased fibrinogen and plasmino- 
gen in the plasma (36,37). Recent studies have confirmed that ATO inhibited the 
aberrant expression of TF at the transcriptional level, in parallel to the improvement in 
hemorrhagic syndrome during ATO treatment (36,38). 

3.4. Low Toxicity 

A few patients developed BM depression and significant cross-resistance to the cur- 
rently used drugs for APL. But upon treatment with ATO, about 70% of the patients 
presented with increased WBC counts, and hyperleucocytosis with signs mimicking 
retinoic acid syndrome was reported in one-third to one-half of the patients (27,32, 
33,39,40). This new "arsenic syndrome" responded to dexamethasone treatment. More 
surprisingly, although arsenic is known as a poison, most patients could tolerate this 
treatment. ATO-related adverse effects, most of which were mild, included skin reac- 
tions (dryness, itching, erythematous lesion and pigmentation), gastrointestinal distur- 
bance with loss of appetite, nausea, vomiting and diarrhea, peripheral neuropathy with 
dysesthesia, joint and muscle pain, facial edema, ECG changes with low-flat T wave, 
sinusal tachycardia, or grade 1 atrioventricular blockage. Thirty percent or so devel- 
oped impaired liver function with elevation of AST, ALT, alkaline phosphatase, GGT, 
or total bilirubin (14,20,23,24,33,34,41^43). Other less frequent adverse effects were 
enlargement of the salivary gland, enlarged thyroid gland without hyperthyroidism, 
oral ulcer, toothache, and gingival or nose bleeding. However, one should be cautious 
in interpreting the unexplained BM necrosis after a daily dose of 0.15 mg/kg ATO 
therapy in a 60-yr-old Japanese male with refractory APL (28). ATO may also suppress 
cell-mediated immunity by inducing apoptosis of T-helper lymphocytes and exert the 
immunosuppressive effect of ATO in vivo. That was activated by an occurrence of her- 
pes zoster during ATO treatment in two patients who were already in remission and 
received ATO as a consolidation treatment (44). These mild side effects can be pre- 
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vented or managed successfully with careful patient monitoring during treatment. 
Tanvetyanon and Nand have proposed that early antiviral treatment might help avoid 
such complications as postherpetic neuralgia (44). It is worth noting that hepatotoxic- 
ity seemed to be higher in newly diagnosed cases in our experience (34). 

3.5. Survival Duration and Postremission Treatment 

In the standard-dose group, among 33 relapsed patients followed up for 7 to 48 mo 
after CR in our institute (34), the estimated disease-free survival (DFS) rate and overall 
survival (OS) rate at 2 yr were 49.11 + 15.09% and 61.55 + 15.79%, respectively. 
These results showed no statistically significant superiority to those obtained among 
14 relapsed patients followed up for 7 to 33 mo on the low dose (29). Hence, low-dose 
ATO treatment seems to be equally effective in APL patients, although a larger-scale 
randomized study should be launched to reach a definitive conclusion. Soignet et al. 
(23) reported that after two courses of ATO therapy, 27 out of 37 patients obtained 
molecular remission, i.e., APL -related PML-RARoc fusion gene transcripts were nega- 
tive by RT-PCR assay. Similar findings were also reported in studies at M. D. Ander- 
son (26), and other small-range studies (45) suggest that molecular remission may be 
achieved at the time of CR in some APL patients with durable DFS. 

3.6. Combination of ATO With ATRA on APL 

Although some reports on the in vitro joint effects of ATRA and ATO are controver- 
sial, several lines of evidence support a strong synergistic effect between ATRA and 
ATO, supporting the feasibility of their combination in clinical application for newly 
diagnosed APL. In a randomized study of ATO alone vs a combination of ATO and 
ATRA in 20 relapsed APL patients, all previously treated with ATRA-containing che- 
motherapy between 1998 and 2001, Raffoux et al. (46) found that the CR rate (80%), 
the hematologic and molecular responses, the time necessary to reach CR, and the 
outcome were comparable in both treatment groups. This led to a conclusion that ATRA 
did not significantly improve the response to ATO in relapsed patients. In our recent 
study (47), 61 newly diagnosed APL subjects were randomized into three treatment 
groups — ATRA, ATO, and the combination of the two drugs. The results revealed that 
all three groups reached a high CR rate of >90%, among which the combination group 
experienced the shortest time to CR and a significantly decreased tumor burden as 
reflected by change of PML-RARoc transcripts at CR. All 20 cases in the combination 
group remained in CR, whereas 7 of 37 cases treated with mono-therapy relapsed after 
a follow-up of 8-30 mo (median: 18 mo) (Fig. 1). 

3.7. Other Arsenic Agents in Leukemia 

Lu et al. (48) used oral tetra-arsenic tetra-sulfide (As 4 S 4 ) to treat 129 APL patients, 
including 19 newly diagnosed, 7 first relapse, and 103 hematologic complete remission 
(HCR) patients. The results revealed that CR was achieved in all patients with newly 
diagnosed APL and with hematologic relapse. Like ATO, As 4 S 4 was well tolerated by 
APL patients with moderate side effects, including asymptomatic prolongation of cor- 
rected QT interval, transient elevation in liver enzyme levels, rash, and mild gastrointes- 
tinal discomfort, without any evidence of myelosuppression or appreciable long-term 
side effects. More interestingly, 14 out of 16 newly diagnosed patients, 5 of the 7 patients 
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Fig. 1. Kaplan-Meier disease-free survival curve. The successive cases of newly diagnosed 
acute promyelocytic leukemia patients were randomized into three groups for induction therapy. 
The relapsed rate of the all-trans retinoic acid-alone treatment group is significantly higher 
than that of combined group (p = 0.0202, Fisher's exact test). When patients of two 
monotherapy groups are put together, the relapse rate is also statistically higher than the com- 
bined group (p = 0.038) (cited from ref. 47). 



with hematologic relapse, and 35 of 44 patients in the HCR group had cytogenetic and 
molecular CR. Obviously, oral As 4 S 4 treatment is highly effective and safe enough in 
both remission induction and maintenance therapy for APL patients. 

4. In Vivo Action of ATO on APL 

The clinical effectiveness of ATO on APL has stimulated research aimed at under- 
standing its mechanism of action. Our initial clinical observations showed that during 
daily continuous intravenous infusion of ATO on APL, with the gradual reduction of 
leukemic promyelocytes, there were a large percentage of myelocyte-like cells carry- 
ing the PML-RARoc gene. This was revealed by fluorescence in situ hybridization 
(FISH) analysis, indicating that ATO could induce in vivo partial differentiation of 
APL cells (49). The presence of leukocytosis and even a retinoic acid syndrome-like 
syndrome during ATO treatment supported this suggestion (49). Moreover, during the 
ATO treatment, cells with condensed nuclei hallmarking the apoptosis process appeared 
in both BM and peripheral blood, which could be positively labeled by in situ terminal 
deoxynucleotidyl transferase (49). Similar observations were described in other clini- 
cal reports (23,24,32,33,35). With a deeper in vivo analysis, for example, Soignet et al. 
(23) reported that ATO treatment induced a progressive decrease in the proportion of 
cells expressing CD33 (an antigen typically associated with primitive myeloid cells) 
along with an increase in the proportion of cells expressing CD1 lb (an antigen associ- 
ated with mature myeloid elements). ATO also increased the number of cells that si- 
multaneously expressed both antigens and displayed a hybrid signal of PML and RARa 
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genes on FISH analysis. Meanwhile, serial Western blot analysis of BM mononuclear 
cells revealed the precursor forms of caspase 2 and caspase 3 were upregulated in 
response to ATO treatment. In addition, ATO-induced apoptosis and modest differ- 
entiation of APL cells in vivo could also be seen in an APL model using syngenic 
grafts of leukemic blasts from PML-RARoc transgenic mice (50). The same was observed 
in subcutaneous tumors that were formed by implantation of ATRA-resistant UF- 1 APL 
cells into hGM-CSF-producing transgenic SCID mice (51). 

5. In Vitro Action of ATO 
5. 1. Apoptosis Induction 

5.1.1. Apoptosis Induction in a Relatively Wide Spectrum of Tumors 

Since we reported that ATO induced the NB4 cell line (a commonly used APL cell 
line with t(15;17) and sensitized to ATRA-induced differentiation (52)) to undergo 
apoptosis with downregulation of the expression of anti-apoptotic gene Bcl-2 and 
modulation of PML-RARoc/PML proteins (21), the apoptosis-inducing and growth- 
inhibitory effects of ATO have been widely confirmed in many cancer cells. These 
include myeloid leukemia cell lines and/or fresh cells from CML, acute megakaryo- 
cyte leukemia, lymphoprolifterative diseases, multiple myeloma, human T-cell leuke- 
mia virus type I (HTLV-I)-infected T-cell lines and adult T-cell leukemia cells, as well 
as some solid tumor cells such as neuroblastoma cell lines, esophageal carcinoma cell 
lines, gastric cancer cells, ovarian cancer cells, and prostate cancer cells (for details, 
see ref. 53). 

5.1.2. Potential Mechanisms of Apoptosis Induction of ATO 
5.1 .2.1 . Mitochondria, Caspases, and ATO-Induced Apoptosis 

A large body of studies have shown that mitochondrial modulation, especially the 
mitochondrial transmembrane potentials (A^Pm) collapse, is a pivotal event in the 
apoptotic process. One of the important factors leading to A^Pm collapse is the opening 
of the mitochondrial permeability transition pore (MPT), which results in the release of 
pro-apoptotic factors such as cytochrome c and the apoptosis-inducing factor into the 
cytosol. These substances, in turn, activate some members of the caspase family (54). 
ATO has been shown to induce A^m disruption in APL cells, malignant lymphocytes, 
and myeloma cell lines, and in a cell-free system (55-57). Arsenite has also been shown 
to open the purified, reconstituted MPT in vitro in a cyclosporin A- and Bcl-2-inhib- 
itable fashion (58). Furthermore, ATO-induced apoptosis in cultured myeloid leuke- 
mia cells was carried out by caspases from the distal, PARP-cleaving part of the 
activation cascade; PKC activation had no effect on apoptosis induced by either ATO 
or VP-16 in these cells (59-61). In addition to caspase-3, caspase-8 and Bid were acti- 
vated by ATO in a GSH concentration-dependent manner in NB4. An inhibitor of 
caspase-8 blocked not only the activation of caspase-3 but also the loss of A^m (62). 
However, Liu et al. (63) suggested that caspase-8 and caspase-3 were primarily in- 
volved in ATO-induced apoptosis in myeloma cells with mutated p53, while primarily 
caspase-9 and caspase-3 were involved in cells expressing wild-type (wt) p53. It is 
worth noting that ATO did not increase the p53 level or activity in most cell types 
tested. ATO also did not induce p53-dependent transactivation, and there was no dif- 
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ference in apoptosis induction between cells with p53 +/+ or p53 _/ " (64). However, a 
marked increase in p53 protein levels was seen during apoptosis in a human gastric 
cancer cell line induced by ATO, while co-incubation with p53 antisense oligonucle- 
otide suppressed ATO-induced intracellular p53 overexpression and apoptosis failed 
(65). Taken together, depolarization of mitochondria and activation of caspases are 
common features involved in ATO-induced apoptosis (66). 
5.1 .2.2. Cellular Glutathione, Reactive Oxygen Species, and ATO-Induced Apoptosis 

It has long been known that the toxic effects of trivalent arsenicals are mediated 
through interaction with active tissue thiols. Although one report (67) showed that 
dithiothreitol (DTT) unexpectedly enhanced ATO-induced apoptosis in leukemic cells, 
most groups have observed that 0.2 mM DTT could block ATO-induced A^m col- 
lapse, caspase activation, and apoptosis in NB4 cells and malignant lymphocytes. On 
the other hand, buthionine sulfoximine (BSO), a depleter of cellular glutathione (GSH), 
substantially enhanced sensitivity. Simultaneous treatment with BSO and ATO could 
increase the sensitivity of HL60 and U937 cells to ATO. Furthermore, GSH depletion 
overcame resistance to ATO in arsenic-resistant cell lines (68). We and others (56,69- 
71) have found that intracellular GSH levels and/or the activity of enzymes including 
glutathione peroxidase (GPx), glutathiones-transferase n (GSTji), and catalase (which 
regulate cellular H 2 2 content) are important factors in determining cell sensitivity to 
ATO. According to these data, we proposed that the ATO-induced A^m collapse and 
apoptosis are associated with thiols of some important proteins, especially those con- 
stituting the MPT complex. Of note, ATO did not cause the oxidation of a critical 
cysteine residue (Cys 56) of purified adenine nucleotide translocator (72), an important 
vicinal SH-group-containing component-constituted MPT complex. 

The amount of cellular thiol groups is regulated by the redox system, while reactive 
oxygen species (ROS) are important factors to induce MPT opening. The generation of 
ROS in apoptosis induction by ATO, partly inhibiting mitochondrial respiratory func- 
tion (73), has been widely confirmed, although the apoptotic activity of ATO in pros- 
tate and ovarian cancer cells is not mediated by superoxide generation (74). Of note, 
ATO at clinically achievable therapeutic concentrations also induced apoptosis in 
peripheral blood T-cells by enhancing oxidative stress (75). Different compounds, such 
as ascorbic acid (69,76) and emodin (77), a natural anthraquinone derivative, sensi- 
tized cells to ATO via generation of ROS. More intriguingly, ATO-induced oxidative 
stress promoted telomere attrition, chromosome end-to-end fusions, and apoptotic cell 
death, which were effectively prevented by the antioxidant ^-acetylcysteine. Further- 
more, embryos with shortened telomeres from late generation telomerase-deficient 
mice exhibited increased susceptibility to arsenic-induced oxidative damage. Unex- 
pectedly, arsenite did not cause chromosome end-to-end fusions in telomerase RNA 
knockout mouse embryos, despite progressively damaged telomeres and disrupted 
embryo viability (78). The authors suggested that ATO could initiate oxidative stress 
and telomere erosion, leading to apoptosis and antitumor therapy on the one hand and 
chromosome instability and carcinogenesis on the other. In addition, the generation of 
ROS was associated with enhanced apoptosis induced by tetra-arsenic oxide (As 4 6 , 
2,4,6,8,9,10-Hexaoxa-l,3,5,7-tetraarsatricyclo [3.3.1.13,7] decane) in ATO-resistant 
U937 leukemic cells (79). 
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5.1 .2.3. c-Jun NH 2 -Terminal Kinases (JNKs) and ATO-Induced Apoptosis 

ATO activated JNK at a dose range similar to that for induction of apoptosis. Acti- 
vation was almost totally blocked by expression of a dominant-negative mutant of 
JNK1 in JB6 cells (64) and in ATO-sensitive but not in ATO-resistant NB4-derived 
subclones (80). The JNK inhibitor dicumarol significantly increased cell growth and 
prolonged survival in response to ATO. Inhibition of nuclear factor (NF)-kB kinase 
(IKK(3) gene knockout (Ikk(3~ /_ ) resulted in prolongation of ATO-induced JNK activa- 
tion, which was closely associated with oxidative stress response, as indicated by 
elevated expression of heme oxygenase- 1 and the accumulation of H 2 2 (81). Fur- 
thermore, activation of JNK in NKM-1, which was established from a patient with 
acute myeloid leukemia (M2), was sustained from 6 to 24 h after 1 umol/L ATO treat- 
ment, and preceded changes in cellular H 2 2 , A(])m, and caspase-3 activation (82). 
These data indicated an essential role of JNK signaling in the induction of growth 
inhibition and apoptosis induction by ATO. 

5.1 .2.4. Mitotic Arrest and Apoptosis Induction 

Apoptosis induction by ATO in normal hematopoietic progenitors and leukemic cells 
might be dependent on their cell-cycle status (83). According to several recent reports 
(84-86), ATO stimulated G 2 /M arrest in promonocytic leukemic cells U937 (85), and 
in other human tumor cell lines such as HeLa S3, which was inhibited by pre treatment 
with the DNA polymerase inhibitor aphidicolin and was related to tubulin polymeriza- 
tion without affecting GTP binding to (3-tubulin. Recently, we (87) showed that APL 
cells are arrested at early mitotic phase before the collapse of A(|)m and apoptosis after 
treatment with pharmacological concentrations of ATO. On the contrary, myeloma 
cells with wild-type (wt) p53 were relatively resistant to ATO, with maximal apoptosis 
of about 40% concomitant with partial arrest of cells in Gj and up-regulation of p21 
(63). These results lead to our proposal that mitotic arrest is one of the common mecha- 
nisms for ATO-induced apoptosis in cancer cells. ATO induced rapid and extensive 
(more than 90%) apoptosis in a time- and dose-dependent manner concomitant with 
arrest of cells in G 2 /M phase of the cell cycle. 

5.2. Induction of Differentiation 

As described above, plasma concentrations of ATO were relatively low (0.25-0.1 
uM) most of the time during remission induction with 0.16 mg/kg/d iv of ATO. Thus, 
we investigated effects of such low-dose ATO on APL cells. The results revealed that 
a relatively longer use of low-dose ATO (0.1-0.5 umol/L) induced partial differentia- 
tion. Furthermore, fresh cells from APL cases were resistant to 1.0 umol/L ATO- 
induced A^Fm collapse/apoptosis, whereas they were sensitive to 0.1 umol/L 
ATO-induced differentiation. The arsenic-resistant NB4 subclone (NB4-AsR) did not 
undergo apoptosis but maintained the partial differentiation response to this drug (88), 
concomitant with the finding on AP-1060, a newly established APL cell line from a 
multiple-relapse patient (89). Obviously, different pathways might be involved in 
differentiation and apoptosis as induced by ATO. However, the mechanism of ATO- 
induced differentiation remains elusive. Recently, the possibility of a modulation of 
histone acetylation (90) and potent inhibition of the interaction of SMRT (91), which 
participates in transcriptional repression by a diverse array of vertebrate transcription 
factors, with its transcription factor partners by ATO has also been proposed. 
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The in vitro differentiation-inducing ability of ATO did not appear to be better than 
its in vivo activity. Thus, it is reasonable to speculate that some factors in BM microen- 
vironments modulate the in vivo activity of ATO. These factors could be some 
cytokines and other molecules. Indeed, APL cells in vitro exposed to ATO at 1 umol/L 
showed a significant increase of IL-1J3 and G-CSF (P < 0.05) production but a signifi- 
cant decrease of IL-6 and IL-8 (92). More recently, Chelbi-alix et al. (93) suggested 
that IFN-oc or IFN-y combined with 0.1 umol/L ATO led to an increased maturation 
effect on NB4 cells and on two ATRA-resistant NB4-derived cell lines, NB4-R1 and 
NB4-R2. A strong synergy was discovered between 0.25 umol/L ATO and the cyclic 
adenosine monophosphate (cAMP) analog in the full induction of differentiation in 
NB4, NB4-R1, and fresh APL cells (94). 

Although leukemic cells do not form a well-circumscribed "mass" in BM like that in 
solid tumors, oxygen levels of BM in AML patients may be decreased as a result of 
rapid growth of leukemic cells. The lack of oxygen in BM is possibly further aggra- 
vated by the anemia that often accompanies newly diagnosed AML patients. Leukemic 
cells are cultured in vitro at ambient oxygen (21%) in most circumstances, while in 
vivo cells are physiologically exposed to much lower oxygen levels, ranging from 16% 
in pulmonary alveoli to less than 6% in most peripheral organs of the body. Based on 
these facts, we tested the effect of ATO on APL cells under hypoxia. Unexpectedly, we 
found that cobalt chloride (CoCl 2 )/ desferrioxamine (DFO)-mimicked hypoxia or mod- 
erate hypoxia (2% and 3% 2 ) triggered and enhanced ATO-induced leukemic cell 
differentiation to various extents in an AML subtype-independent manner ([95] and 
data not shown). 

5.3. ATO and PML/PML-RARa 

About 95% APL expresses PML-RARoc chimeric proteins, which can block cell 
differentiation and apoptosis through dominant negative inhibition of biologic func- 
tions of wild-type PML, RARoc, as well as other proteins (for details, see ref. 1). PML, 
a tumor suppressor involved in many complex functions through gene transcription 
regulation (e.g., growth arrest and apoptosis) is normally localized in a nuclear domain 
named the PML nuclear body (NB), PML oncogenic domain (POD), or nuclear domain 
10 (ND-10). In APL cells, due to the heterodimerization of PML-RARoc with wild-type 
PML, PML-NBs are disrupted into a micro-speckle nuclear pattern, resulting in the 
loss of functions of PML and/or NB. Modulation and/or degradation of the chimeric 
proteins and reorganization of NB components have been widely accepted as important 
molecular mechanisms based on ATRA-induced APL cell differentiation. Although 
arsenic exerts its effect on many cell types, APL seems to be the most striking clini- 
cal target for ATO. This encouraged exploration of possible effects of ATO on APL- 
specific oncogenic mechanisms, such as PML-RARoc fusion proteins. Interestingly, 
ATO can rapidly modulate the subcellular localization of PML and PML-RARoc and 
induce the degradation of these proteins in APL cells and cells expressing exogenous 
PML-RARoc (21,49,70,71,88,96-98). The latter are associated with the SUMO-1- 
polymodified forms of PML. 

Sternsdorf et al. (98) reported that ATO induces apoptosis only in U937 cells express- 
ing PML-RARoc. Of note, trivalent antimonial-induced apoptosis in NB4 cells accompa- 
nies degradation of PML-RARoc and the reorganization of PML NBs (99). Comparative 
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studies between cells expressing PML-RARa and PLZF-RARa proteins, the latter 
being produced by another APL-related chromosome translocation t(l 1 ; 17), showed 
that ATO can induce apoptosis on U937 and APL cells expressing the former, but not 
the latter, fusion protein(700,70i). It thus seems that ATO induces apoptosis of APL 
cells by targeting PML-RARa. However, the wide spectrum of ATO-induced 
apoptosis, as well as the degradation of PML/PML-RARoc in a subline of NB4 cells 
(NB4/As) resistant to ATO-induced apoptosis (62), show that PML/RARoc degrada- 
tion is not the only mediator for arsenic sensitivity. 

Puccetti et al. (102) reported that sensitivity to ATO-induced apoptosis in U937 
cells can be increased either by overexpression of PML or by conditional expression of 
activated RAS, which upregulates PML. Moreover, ATO treatment induced phospho- 
rylation of the PML protein through a mitogen-activated protein (MAP) kinase path- 
way. Increased PML phosphorylation is associated with increased sumoylation of PML 
and increased PML-mediated apoptosis. Conversely, MAP kinase cascade inhibitors 
or the introduction of phosphorylation- or sumoylation-defective mutations of PML 
impair apoptosis, indicating that phosphorylation by MAP kinase cascades potentiates 
the antiproliferative functions of PML and helps mediate the proapoptotic effects of 
ATO (103). 

Interestingly, among those nonresponder-to-ATO patients described in the SIH clini- 
cal trial series (27,29), two relapsed cases, one from a standard-dose group and the 
other from a low-dose group, were PML-RARa negative at the time of treatment in 
spite of the fact that fusion gene transcripts were positive at initial disease presentation. 
Another patient from the standard-dose group developed a new malignant clone with 
the AML1-ETO fusion gene, a hallmark of AML-M2, in addition to PML-RARa ( 34). 
Therefore, the in vivo sensitivity to ATO appears to require the expression of PML- 
RARa. 

5.4. Inhibition of Angiogenesis 

Angiogenesis plays a critical role in the growth of solid tumors and may be impor- 
tant for the expansion of leukemic cell populations. An interesting finding by Roboz et 
al. (104) was that treatment of proliferating layers of human umbilical vein endothelial 
cells (HUVECs) with ATO resulted in a reproducible dose- and time-dependent alter- 
ation including activation of endothelial cells, upregulation of endothelial cell adhe- 
sion molecules, apoptosis of endothelial cells, and inhibition of vascular endothelial 
growth factor production. Incubation of HUVECs with ATO prevented capillary tubule 
and branch formation, as shown by in vivo and in vitro endothelial cell differentiation 
assays. In experimental solid tumors, a single administration of ATO produced prefer- 
ential vascular shutdown in the tumor tissue with resultant hemorrhagic necrosis (105). 
This phenomenon was repeatable, and no apparent toxic effects were observed on nor- 
mal skin, muscle, or kidneys of the experimental animals. 

5.5. Effects of Metabolites of ATO 

Biomethylation is the major metabolic pathway for inorganic arsenic (iAs) in humans 
and in many animal species, within which iAs undergoes metabolic conversion that 
includes reduction of iAs v to iAs 111 , with subsequent methylation yielding mono 
(MM)- and dimethylated (DM) metabolites. The postulated scheme is as follows: 
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iAs v ^iAs m ->MAs v ^MAs m ->DMAs v ^DMAs m ^MAs m and DMAs 111 . NB4 cells did not 
methylate iAs m , but mono- and dimethylated metabolites formed from iAs m in HepG2 
cells were released into the medium (106). Indeed, MMA m , DMA 111 , MMA V , and 
DMA V could be detected in urine samples collected consecutively for 48 h in four 
tested cases of APL patients receiving ATO treatment at standard dose; As 111 , MMA V , 
and DMA V accounted for >95% of the total arsenic excreted (107). Therefore, it is 
possible that methylated ATO metabolites that form in vivo may contribute to the thera- 
peutic effect of ATO in APL. Ochi et al. (108) concluded that DMA 111 was the most 
potent in terms of the ability to cause apoptosis in HL-60 cells; arsenite was less potent. 
Our in vitro studies (706) indicated that MMA m , and to a lesser extent DMA 111 , were 
more potent growth inhibitors and apoptotic inducers than iAs m in leukemia and lym- 
phoma cells, but not in human BM progenitor cells. H 2 2 accumulation and GPx inhi- 
bition, but not degradation of PML-RARoc, correlated with greater MAs m O-induced 
apoptosis of NB4 cells. 

6. Clinical Trials of ATO in Other Cancers 

Preclinical investigations as described above indicated that the biological targets of 
ATO, including induction of apoptosis, nonterminal differentiation, and suppression of 
proliferation and angiogenesis, extend to a variety of malignancies other than APL. 
Thus, many clinical physicians and scientists have tried to enumerate the potential 
benefits of ATO in cancers other than APL (109). The National Cancer Institute is 
working cooperatively with research centers across the United States to evaluate clini- 
cal activity of ATO in hematologic malignancies such as AML, acute lymphocytic 
leukemia, CML, non-Hodgkin's lymphoma, Hodgkin's disease, chronic lymphocytic 
leukemia, myelodysplastic syndrome, and multiple myeloma. Similar research was also 
carried out on solid tumors, such as advanced hormone-refractory prostate cancer and 
renal cell cancer, and in cervical cancer and refractory transitional cell carcinoma of 
the bladder. The safety and pharmacokinetics of ATO are being evaluated in pediatric 
patients with refractory leukemia and lymphoma. The results of these ongoing studies 
should provide important insights into the clinical utility of ATO in these diseases 
(110,111). We now summarize results of a clinical trial on adult T-cell leukemia/lym- 
phoma (ATL). ATL is a severe chemotherapy-resistant malignancy associated with 
prolonged infection by the human T-cell-lymphotropic virus 1 (HTLV-1), a retrovirus. 
Substantial data revealed that ATO combined with IFN-oc could induce cell-cycle ar- 
rest and apoptosis of ATL cells both ex vivo and in vitro (112-114). Moreover, ATO 
was shown to rapidly and selectively block the transcription of NF-KB-dependent genes 
in HTLV- 1 -infected cells only via dramatic stabilization of IkB-oc and IkB-(3 (115- 
116). Inspired by these in vitro observations, Hermine (31) initiated a phase II trial of 
an ATO/IFN combination in seven patients with relapsed/refractory ATL (four acute 
and three lymphoma). Four patients exhibited a clear initial response (one complete 
remission and three partial remissions). Yet, the treatment was discontinued after a 
median of 22 d because of toxicity (three patients) or subsequent progression (four 
patients). Six patients eventually died from progressive disease (five patients) or infec- 
tion (one patient), but the remaining patient is still alive and disease-free at 32 mo. In 
conclusion, arsenic/IFN treatment is feasible and exhibits an antileukemic effect in 
very poor prognostic ATL patients despite significant toxicity. 
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Finally, 14 patients with metastatic renal cell carcinoma (RCC) with bidimensionally 
measurable disease, a Karnofsky performance status of at least 70%, life expectancy of 
greater than 3 mo, and no evidence of brain metastases, were treated in a phase II trial 
with ATO given intravenously at a dose of 0.3 mg/kg/d for five consecutive days every 
4 wks. The most common forms of toxicity observed were grade II elevation in liver 
function tests (36%), anemia (21%), renal insufficiency (14%), rash (7%), and diarrhea 
(7%). Best response was stable disease in three patients, with one patient remaining in 
the study at 8+ mo. At the dose and schedule used in this trial, ATO did not achieve a 
complete or partial response in metastatic renal cell carcinoma (117). 

7. Conclusions 

Although treatment with ATRA has greatly improved patient survival, 30% or so of 
patients still relapse following combination therapy with ATRA and cytotoxic chemo- 
therapy. From recent studies that have demonstrated the efficacy and safety of ATO in 
treating APL patients, ATO is now considered the standard of care in relapsed/refrac- 
tory APL, which has excited Western and Chinese oncologists (22). Unlike patients 
treated with ATRA alone, more intriguingly, patients treated with ATO alone have a 
high rate of molecular remission, i.e., molecular conversion to PML-RARa negativity. 
On the other hand, the multiple mechanisms by which ATO can induce cell differentia- 
tion, apoptosis, and anti-angiogenesis suggest its potential for clinical application alone 
or combined with other agents to manage a variety of malignancies. The experience 
with ATO leads us to predict that a merger between TCM and modern molecular biol- 
ogy will bring benefits for cancer therapy. It also emphasizes the importance and feasi- 
bility of natural products in discovery of drugs against ailments including cancer. 

There are many questions remaining to be addressed in the future, which will help to 
uncover how ATO functions within the cell, to explore new drugs and new clinical 
applications of ATO. For example, does ATO achieve its clinical effectiveness by 
apoptosis and/or differentiation induction, and does PML-RARa involve clinical action 
of ATO in APL patients? Answering such questions will provide important clues for its 
possible clinical use in leukemia and other cancers and improve its application. If it is 
true that differentiation induction and PML-RARa degradation are critical for the in 
vivo effects of ATO in APL, whereas apoptosis induction is relevant to its toxicity, we 
can speculate on its ineffectiveness in cancers beyond APL. Second, it is very impor- 
tant to determine by which mechanism(s) ATO induces APL cells to differentiate. The 
answer will not only advance our understanding of mechanisms of leukemic cell differ- 
entiation, but will also promote identification of differentiation-related drug target(s), 
and could lead to some new differentiation-inducing drug(s). Third, many drugs are 
synthesized to achieve better efficacy and safety profiles according to the chemical 
structure of natural products. For example, development of Lipitor and Atorvastatin 
were clearly inspired by natural-product HMG-CoA reductase inhibitors such as 
lovastatin and simvastatin. In this sense, it is very meaningful and significant to syn- 
thesize new compounds similar to ATO or to chemically modify ATO so as to reduce 
its toxicity and to keep or even enhance its clinical effectiveness. Based on the in vivo 
synergistic effects of ATO and ATRA, it has been proposed that ATO be integrated 
into the backbone of ATRA (47). Finally, the long-term effect of ATO in treatment of 
APL remains to be illustrated, in spite of higher molecular remission. It is also essential 
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to investigate new combinations of arsenic agents with other chemotherapy agents and 
ATRA in large clinic trials. 
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Summary 

Natural-product-derived drugs are a major portion of the total number of approved drugs in the 
antibacterial area. The majority of bacteria and fungi in the environment is only known by molecular 
fingerprints and has resisted cultivation. Therefore, new methods have been developed to access this 
tremendous microbial diversity for the discovery of novel small molecules. These culture-dependent 
and -independent methods include a novel high-throughput cultivation technology as well as a recom- 
binant approach to discover and express novel natural products. 

Key Words: Microbial diversity; high-throughput cultivation; recombinant approach; natural 
products; genomic libraries; high-throughput screening; cultivation-dependent and -independent 
methods. 

1. Introduction 

Natural products with high therapeutic potential derived from bacteria and fungi 
have played a tremendous role in drug discovery since the discovery of the first antibi- 
otic — penicillin — by Alexander Fleming in 1929 (1). Since then, almost 20,000 metabo- 
lites of microbial origin have been described (2). Around 80% of all microbial-derived 
secondary metabolites are produced by members of the order Actinomycetales (phylum 
Actinobacteria) with the genus Streptomyces accounting for approx 50% (3-5). The 
remaining 20% of secondary metabolites are produced by members of the phyla 
Bacteroidetes (e.g., Chryseobacterium), Cyanobacteria (e.g.,Nostoc), Firmicutes (e.g., 
Bacillus), and Proteobacteria (e.g., Myxococcus) (6-9). Despite attempts to isolate and 
culture new microorganisms by traditional cultivation methods, the discovery rate of 
new structural classes of antimicrobial molecules has declined (10,11). Isolated culture 
extracts have yielded numerous previously described metabolites (12), and the rate of 
rediscovery of known antibiotics is approaching 99.9% (13). In addition, pharmaceuti- 
cal companies shifted from traditional natural-product discovery to more combinato- 
rial chemistry-based discovery. The generation of large chemical-compound libraries 
pushed the development of high-throughput screens based on molecular targets. Com- 
binatorial chemistry, and more recently high-throughput parallel syntheses, has been 
implemented to generate these diverse chemical libraries. Consequently, many phar- 
maceutical companies have de-emphasized research on natural products in favor of 
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Table 1 

Number of Approved Antibacterial Drugs (15) 

Year Number of antibacterial drugs Natural product-derived Totally synthetic 

17 7 

6 5 

2 



mass-produced combinatorial libraries. However, the expected surge in productivity 
has not materialized, and the number of new active substances introduced into the mar- 
ket by the pharmaceutical industry has hit a 20-yr low of 37 substances in 2001, and is 
still declining (14). The US Food and Drug administration (FDA) received 16 new 
drug applications in 2001, down from 24 the previous year (14). The number of anti- 
bacterial approved drugs is declining rapidly (Table 1). 

Natural-product-derived drugs are still a major portion of the total number of approved 
drugs in this therapeutic area (Table 1). The generation of truly novel diversity from 
natural-product sources, combined with total and combinatorial synthetic methodolo- 
gies, including the manipulation of biosynthetic pathways, provides the best solution 
to the current productivity crisis faced by the scientific community engaged in drug 
discovery and development (14-16). Although almost 20,000 microbial metabolites 
and approximately 100,000 plant products have been described so far, secondary 
metabolites still appear to be an inexhaustible source of lead structures for new anti- 
bacterial, antiviral, antitumor, and agricultural agents (2). But how can we generate 
novel diversity from natural-product sources? Are there still novel organisms to be 
discovered and isolated that harbor novel chemical structures, and how can we access 
such novel microorganisms? 

When assessing microbial diversity, Torsvik and co-workers estimated that a pas- 
ture soil sample contained about 3500 to 8800 genome equivalents (17). This could 
result in around 10,000 different species of equivalent abundances (18). Other esti- 
mates of species diversity within a single soil sample vary from 467 species (19) up to 
500,000 species (20). It is estimated that only 1 % or less of the bacteria that are known 
to exist in the soil environment have been cultured. Therefore, the soil environment 
harbors a vast number of uncharacterized microorganisms (18,21-27). 

In addition, studies of different environments such as open sea water and marine 
sediment, and even extreme environments such as hot springs, demonstrate that the 
diversity of microorganisms is far larger than ever anticipated (28-30). Analysis of 
microbial diversity within the past decades has resulted in a tremendous increase of 
new phylotypes. When Carl Woese and colleagues defined the major bacterial phyla 
almost 20 yr ago (31), they recognized 11 phyla. After two decades of environmental 
16S rRNA gene sequencing from almost every habitat on Earth, 41 additional phyla 
have been described (32), bringing the number to a total of 53 bacterial phyla (the 
former Gram-positive bacteria are now two separate phyla, the Firmicutes and 
Actinobacteria). Only 27 out of 53 bacterial phyla contain previously cultivated micro- 
organisms (Fig. 1A). Half of the bacterial phyla recognized to date consist entirely of 
so far uncultured bacteria, and have been described solely by their 16S rRNA gene 
sequences (32,33). In addition, only a small fraction of organisms that can be cultured 
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Fig. 1. Reconstructed phylogenetic tree of the domain Bacteria based on 16S rRNA gene se- 
quences (A) (27). Bacterial phyla with cultivated representatives are shown in gray and black. Num- 
bers in bold represent valid published species within a phylum (35). Shown in black are phyla with 
representatives that produce anti-infective molecules (based on the Dictionary of Natural Products, 
Version January 2003 [36]) (A). Relative distribution of all bacterial phyla with cultivated represen- 
tatives (B). "Other" refers to all phyla shown in gray in A. Modified with permission from Nature 
Reviews Microbiology from ref. 27, copyright 2004 Macmillan Magazines Ltd. 
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have been at least partially characterized (34). Surprisingly, so far only five phyla in- 
clude isolates producing bioactive molecules (Fig. 1A). These five phyla (Actinobac- 
teria, Bacteroidetes, Cyanobacteria, Firmicutes, and Proteobacteria) on the other hand 
represent 95% of all cultivated and validly published species (Fig. IB) (27). The rest of 
the cultivated phyla (22 phyla) represent only 5% of all species (27). Within these 
cultivated phyla, certain bacterial groups are represented to a greater extent than oth- 
ers. For example, representatives of the order Actinomycetales (phylum Actinobacteria) 
account for 27% of all available and recognized bacterial species, and members of the 
genus Streptomyces (order Actinomycetales) represent 10% of published bacterial spe- 
cies (35). On the other hand, many phyla are represented by only very few isolates; 
some contain only one described species. This untapped microbial diversity represents 
a tremendous potential for discovery of novel bioactive small molecules. But how can 
we get access to this incredible microbial biodiversity? 

In order to develop novel drugs derived from natural products, three key issues must 
be addressed. 

1. Access to truly novel diversity. 

2. Faster chemical dereplication methods to determine whether the active compound is novel 
or known. 

3. New methods to address production and chemical modification of the novel chemical 
entities. 

Recently, advances in microbiology, molecular biology, and analytical chemistry 
have started to address many of these issues. Within this chapter we shall discuss new 
methods to access microbial biodiversity with culture-dependent and independent 
methods. 

2. Access to Microbial Diversity Through New Cultivation Approaches 

Environmental microbiologists are facing a major challenge today. In order to gain a 
comprehensive understanding of microbial physiology or to fully access metabolic 
pathways containing genes dispersed throughout the genome (37,38), cultivation of 
microorganisms will be required. The enrichment and cultivation techniques devel- 
oped by Pasteur, Koch, Beijerinck, and Winogradsky (39^t2) facilitated new discover- 
ies in microbiology for over 100 yr. Such conventional cultivation of microorganisms, 
however, is selective and biased towards growth of specific microorganisms (43,44). 
The majority of cells obtained from the environment and visualized by microscopy are 
viable, but they do not generally form visible colonies on plates. Visible colonies on 
plate count media require at least 10 5 cells to be identified by the naked eye, and tradi- 
tional media select strongly for microbes that grow rapidly to high density, are resistant 
to high concentrations of nutrients, and are able to grow in isolation. 

It could be argued that these traditional culturing strategies use conditions that are 
completely different from the normal growth habit of many microbes, and are a major 
contributing factor to the failure to cultivate most microorganisms in pure culture 
(18,21,44). It has often been claimed that plate count methodologies are not suitable 
for cultivation of bacteria and that the members of groups without cultivated represen- 
tatives are somehow "nonculturable" (45,46). Janssen and co-workers addressed the 
issue of high concentration of nutrients in cultivation media and investigated the 
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culturability of soil microorganisms using diluted nutrient broth, assuming that many 
of the so far uncultivated microorganisms are inhibited by high concentrations of nutri- 
ents (47). Extended incubation times for at least 10 wk were required to allow maxi- 
mum colony development. It was demonstrated that at least some of the bacteria 
detected by molecular ecological methods were not unculturable and that their isolation 
in pure culture did not require elaborate or expensive cultivation strategies (47-50). 

Other studies dealt with the effect of signal compounds on the cultivation efficiency 
of aquatic bacteria (51-55). Pure cultures of bacteria, typical for these aquatic environ- 
ments, could be recovered from natural planktonic assemblages. Bruns et al. (54,55) 
demonstrated that addition of c AMP to complex medium effectively increased the culti- 
vation success, and up to 10% of the total bacterial counts determined by epifluorescence 
microscopy after staining with 4',6-diamidino-2-phenylindole (DAPI) could be culti- 
vated after the addition of cAMP. 

Button et al. developed a dilution cultivation method for growing bacteria from 
oligotrophic aquatic environments (56). Populations of marine microorganisms were 
measured, diluted to a small and known number of cells, inoculated into plain sterilized 
seawater, and examined for the presence of 10 4 or more cells per mL over a 9-wk time 
interval. Doubling times observed were in the range of 1 d to 1 wk. Button speculated 
based on his experiments that most marine bacteria (60%) are viable and that low esti- 
mates of viability by traditional techniques resulted from the fact that most marine 
bacteria reach stationary phase before attaining visible turbidity (56). 

Others also reasoned that previously uncultivable microorganisms might grow in 
pure culture if provided with the chemical components of their natural environment 
(57). To allow access to these components, they placed marine microorganisms in dif- 
fusion chambers and incubated the chambers in an aquarium that simulated the natural 
setting for these organisms. The membranes allowed exchange of chemicals between 
the chamber and the environment but restricted movement of cells (57). Although the 
isolated cultures did not represent new phylotypes, large numbers of colonies of vary- 
ing morphologies were obtained after 1 wk of incubation in the chambers. 

In addition to cultivating novel microorganisms, the speed of isolation can be a sig- 
nificant factor for the discovery of novel small molecules. Development of automated 
isolation methods that generate thousands of different microorganisms (58) would be a 
significant advance in the discovery process. Connon and Giovannoni described the 
development of a high-throughput cultivation method enabling the identification of a 
large number of extinction cultures (59). Over a time course of 3 yr, many new micro- 
bial strains were isolated, including members of previously uncultured groups that are 
abundant in seawater (60). The use of microtiter dishes in combination with an auto- 
mated cell array and imaging process enabled them to increase sensitivity for the detec- 
tion of cells and therefore shorten incubation times for cells with low growth rates 
relative to those in previous studies that employed the concept of extinction culturing 
in natural media (56). The percentage of cells that could be cultured by this approach was 
several orders of magnitude higher than that obtained by cultivation on agar plates (59). 
A universally applicable method that allows access to the immense reservoir of 
untapped microbial diversity is an automated and extremely high-throughput cultiva- 
tion approach developed by Zengler et al. (58). This technique combines encapsulation 
of single cells in microcapsules for massively parallel microbial cultivation under low 
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Fig. 2. Discrimination among (A) free-living cells, (B) singly occupied or empty 
microcapsules, and (C) microcapsules containing microcolonies was accomplished by flow 
cytometry in forward and side light-scatter mode. 



nutrient flux conditions, followed by flow cytometry to detect microcapsules contain- 
ing microcolonies (Fig. 2). 

The microbial community, represented in single encapsulated cells, is grown in col- 
umns under very low nutrient flux conditions using mineral medium supplemented 
with low concentrations of nutrients extracted from the sampling site (Fig. 3). Over 
time, each cell capable of growth under the conditions in the column forms a 
microcolony within its enclosure (Figs. 3, 4). The high-throughput nature of this tech- 
nology is based on the use of high-speed flow cytometry and cell sorting to identify 
microcapsules containing microcolonies of >20 cells (Fig. 2), and then sort each posi- 
tive microcapsule into a well of a microtiter plate for further cultivation and analysis. 
The change in forward and side scatter is proportional to the colony size within the 
microcapsule. This allows the separation of microcapsules containing microcolonies 
from those without growth (Fig. 2). 

Using this technology, it was demonstrated that novel, previously uncultured organ- 
isms can form microcolonies within the microcapsules and can generate a pure culture 
once separated (58). During further incubation, microorganisms are able to divide and 
outgrow the microcapsules. The cultivation of microorganisms within the microcap- 
sules allows the reconstituted microbial community to be simultaneously cultivated 
"together" and "apart" because each "caged microcolony" can later be separated and 
analyzed. The ability to reconstitute the community in the column of microcapsules 
allows for diffusive cross-feeding of metabolites and other molecules (e.g., regulatory 
and signaling molecules) between members of the community. This feature also simu- 
lates the natural environment, and thus preserves some of the community interactions 
and other specific requirements that may be needed for successful cultivation. 

In addition, microbes are grown in an open flow system that simulates natural envi- 
ronments (Fig. 3), where microbes are exposed continually to a low concentration of 
nutrients. This is in contrast to a closed batch system where microbes receive a high 
concentration of nutrients at one point in time and metabolic byproducts can build up 
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to unnaturally high and inhibitory concentrations. The low concentration of nutrients 
also minimizes overgrowth by fast-growing organisms, thereby allowing propagation 
of microorganisms with extremely slow growth rates and/or that only grow to low cell 
densities. The high-throughput production, screening, and sorting of microcapsules is 
automated, and easily and economically scaleable. 

This high-throughput cultivation technology (58) is capable of isolating massive 
numbers of cultures and is therefore sufficient to supply increased diversity for modern 
high-throughput screening systems in drug discovery. High-throughput cultivation can 
provide tens of thousands of bacterial and fungal isolates per environmental sample. 
However, this increase in throughput requires advanced methods to characterize unique 
strains rapidly. To deal with this tremendous number of strains, a fully automated high- 
throughput method to determine the uniqueness of the bacterial and fungal isolates has 
been developed. Using Fourier transform infrared spectroscopy (FT-IR) (61,62) in com- 
bination with novel spectra-comparison algorithms, microtiter plate cultures can be 
analyzed and unique strains grown at larger scale (for example, 4-mL cultures) for 
organic extract preparation (Diversa Corporation, unpublished data). 

To prove that this technology is suitable to quickly isolate cultures with biological 
activity, 10,000 unique bacterial and 1500 fungal strains were isolated during a 4-mo 
period, and the organic extracts prepared were tested against a panel of Gram-positive 
and Gram-negative bacteria. Several hundred cultures with chemically relevant anti- 
bacterial activities have been identified (Diversa Corporation, unpublished data) (Fig. 5). 

Most of the recently described cultivation technologies try to simulate and mimic 
the natural environment. This has led to the use of extremely dilute culture media sup- 
porting only minimal cell growth, resulting in prolonged incubation times. Accord- 
ingly, technology has to be developed to work with minute amounts of cells, for 
example microcolonies on plates or within microcapsules. Some of the newly isolated 
microorganisms will not reach high cell densities, such as routine laboratory strains of 
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Bacillus or Escherichia when transferred into organic rich medium. Many of these new 
isolates can be a challenge for culture collections and will therefore require new meth- 
ods for preservation, storage, and handling. In addition, these cultures will require the 
further refinement of molecular and analytical tools in order to study them. For natural- 
product discovery, the scale-up of some of these cultures for large-scale production 
could also be a potential issue. New analytical tools such as liquid chromatography 
(LC)-tandem mass spectrometry (MS-MS) or micro-coil nuclear magnetic resonance 
(NMR) allow structure elucidation of interesting molecules from very small amounts 
of material (Diversa Corporation, unpublished data). However, it can still be a chal- 
lenge to isolate enough material for further biological studies or medicinal chemistry. 
A promising approach to overcome these issues lies within the recombinant natural 
products (RNP) platform. 

3. Recombinant Approaches to Discover or Express Novel Natural 
Products 

Recent advances in molecular biology led to the development of a recombinant ap- 
proach for natural-product discovery (24,63-66). The genetic information for the bio- 
synthesis of natural small molecules is encoded in compact gene clusters that can be 
cloned from the producing organism and expressed in a heterologous tractable host. 
This approach aims to access pathways of so far uncultured and intractable microor- 
ganisms. Genomic DNA is extracted directly from an environmental sample without 
the need to grow the associated microbes. Alternatively, genomic DNA is isolated from 
cultures, and cloned into suitable vectors. The libraries thus made are introduced into 
an engineered surrogate host for expression of small-molecule pathways (63,64,67-72). 
The pathways typically contain biosynthetic genes, resistance genes involved in self- 
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Table 2 

Illustrative Examples of Inter-Species and Inter-Genus Heterologous Expression 

of Natural-Product Pathways 









Pathway, 




Donor organism 


Vector system 


Heterologous host 


natural product 


References 


Streptomyces 


Plasmid, cosmid 


Streptomyces lividans, 
Streptomyces parvulus, 
Streptomyces coelicolor 


Polyketides, 
nonpolyketides 


80,83-86 


Rare 










Actinomycetes, 


Plasmid, cosmid 


Streptomyces albus 


Rebeccamycin 


87 


e.g Saccharothrix 










aerocolonigenes " 










Myxobacteria, 


Plasmid 


Streptomyces coelicolor 


Epothilone 


88 


e.g., Sorangium 










cellulosum 


Integration into 
chromosome 


Myxococcus xanthus 


Epothilone 


89 


Bacillus licheniformis 


Integration into 
chromosome 


Bacillus subtilis 


Bacitracin 


90 


Lysobacter lactamgenus 


Plasmid 


Pseudomonas putida 


Cephalosporin 


91 


Pseudomonas fluorescens 


Plasmid, cosmid 


Pseudomonas putida 


Pseudomonine 


92 


Serratia sp. 


Cosmid 


Erwinia carotovora 


Prodigiosin 


93 


Soil DNA library 


Cosmid 


Escherichia coli 


Violacein 


94 


Soil DNA library 


Cosmid 


Escherichia coli 


long chain Af-acyl- 
tyrosine 


95,96 


Soil DNA library 


BAC 


Escherichia coli 


Indigo blue, 
indirubin 


71 


Soil DNA library 


BAC 


Escherichia coli 


Turbomycin 


97 



protection, regulatory genes, and genes for the export of the product, all tightly linked 
on the chromosome (73,74). The genes within these pathways are either expressed 
individually or expressed coordinately, along with the adjacent genes, forming oper- 
ons. Pathways vary in size from as small as 5 to 8 kb (phosphomycin [75] and 
pyrrolnitrin [76]), to over 100 kb (candicidin [77] and rifamycin [78]). The majority of 
the pathways range from 20 to 200 kb in size (79). The cloning of the pigmented anti- 
biotic pathway for actinorhodin from Streptomyces coelicolor and its expression in 
Streptomyces parvulus was the first example of heterologous expression of a pathway 
in a representative of the Actinomycetales (80). Since then, many examples of heter- 
ologous expression of small-molecule pathways in Actinomycetales as well as other 
Gram-positive and Gram-negative bacteria have been described (Table 2). 

So far, several genomes from different members of the Actinomycetales have been 
sequenced (79) — for example, Thermonospora fusca (Integrated Genomics), Sacchar- 
opolyspora erythraea (Biotica), Saccharopolyspora spinosa (Dow AgroSciences), 
Streptomyces avermitilis (81), Streptomyces coelicolor (82), and Streptomyces 
diversa™ (Diversa Corporation). The published genome sequences of two Streptomy- 
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ces species — S. avermitilis (81) and S. coelicolor (82) — revealed that Streptomyces 
contains many more as yet unexplored secondary metabolic pathways than previously 
thought. The genomes of S. avermitilis and S. coelicolor encode more than 20 known 
and putative secondary metabolic gene clusters each. These apparently cryptic path- 
ways consist of type I, type II, and type III polyketide synthases (PKS), nonribosomal 
peptide synthetase (NRPS), as well as hybrid PKS-NRPS pathways (81,82). In addi- 
tion, genomic partial DNA sequencing suggested that several actinomycete genera have 
the capability to produce many as yet undiscovered secondary metabolites (98). Out of 
50 strains examined, 8 strains contained core sequences for the biosynthesis of the 
enediyne anticancer compounds (98). These compounds had not been detected in these 
strains before. Furthermore, 15% of all examined actinomycetes revealed sequences 
for the enediyne pathway (98). These data suggest that microorganisms such as mem- 
bers of the Actinomycetales may contain many more novel natural products than previ- 
ously observed. Therefore new technologies have been developed to access these 
natural products (63). 

4. Genomic Libraries From Single Isolates 

Natural-product pathways have been cloned and characterized using plasmids, 
cosmids, and fosmids, with a general capacity to hold about 40 to 50 kb of DNA. To 
clone larger DNA fragments, vector systems such as PAC and BAC vectors were de- 
veloped. The PAC vectors are based on the Escherichia co//-transducing phage PI, and 
can carry inserts >100 kb (99,100). The BAC vector was initially developed to con- 
struct human genomic libraries (101). It is based on the E. coli F'-plasmid and can 
accommodate inserts of up to 300 kb. These vectors can be shuttled between E. coli 
and the host of interest by adding appropriate sequences for replication and mainte- 
nance. Several single genome fosmid and BAC libraries have been constructed and 
published for bacterial isolates such as Aquifex (102), Mycobacterium (103), Entero- 
coccus (104), Bacillus (68), Corynebacterium (105), and Streptomyces (100,106). The 
average insert size varied between 30 kb and 70 kb. Inserts larger than 130 kb for 
Actinomycetales libraries (106) and a Bacillus library (68) have also been reported. 

5. Genomic Libraries From the Environment 

The construction of DNA libraries from the environment represents a bigger chal- 
lenge than making libraries from single isolates. One of the reasons is the presence of 
different types of contaminants within the isolated DNA, which can inhibit effective 
cloning. Many of these contaminants such as humic acids can be difficult to eliminate 
without shearing of the DNA. Several methods for preparing DNA suitable for cloning 
recently have been published (107-111). Environmental small DNA insert expression 
libraries (5 kb or less) were pioneered by Short and colleagues from different environ- 
ments to study microbial complexity and to screen for bioactive molecules such as 
enzymes and small pathways (23,24,26,63,65,112,113). However, there have been only 
a few published examples of large (64,114,115) insert libraries (40 kb and above) con- 
structed from environmental samples. Recently, soil DNA libraries have been con- 
structed in either a cosmid vector (776) or BAC vector (68,71) with an average insert 
size ranging between 27 and 45 kb, although inserts as large as 120 kb have been 
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captured in the BAC libraries (71). These libraries contained genes from previously 
unidentified phylotypes, as revealed by 16S rRNA gene sequencing (Diversa Corpora- 
tion, unpublished data; [68,71,117]). The size of the library (number of clones) to cover 
the microbial diversity in an environmental sample is dependent on the number of spe- 
cies present in the sample, the size of their genomes, the size of the pathway, and the 
insert size. To identify clones carrying intact pathways assuming a random distribution 
of genomic fragments in the library, several hundreds to thousands of clones per ge- 
nome need to be screened. 

6. Screening for Novel Natural-Product Pathways 

Environmental libraries can be screened for novel natural products by three general 
methods — a sequence-based approach, a hybridization-based approach, and an expres- 
sion-based approach. 

Sequence-based approach. In the early 1990s, Diversa Corporation pioneered the 
use of environmental clone libraries for the sequence-based discovery of novel enzymes 
and small-molecule pathways (118). In this sequence-based approach, the libraries can 
be screened for novel pathways such as those encoding type I and type II polyketide 
synthases (PKS) as well as nonribosomal peptide synthetase (NRPS). The compounds 
in these families possess diverse chemical structures with a variety of biological activi- 
ties, but are conserved in their core biosynthetic gene sequences (119-121). Similarly, 
there are biosynthetic genes involved in the modifications of natural products, e.g., 
glycosylation, which are highly conserved (122). These conserved secondary meta- 
bolic sequences have been used successfully as probes to isolate corresponding genes 
and the associated gene clusters from single-isolate genomic libraries (123,124). This 
principle has been extended to environmental genomic libraries. For example, using 
conserved sequences within type II PKS genes as polymerase chain reaction (PCR) 
primers, novel type II PKS genes were amplified from soil genomic DNA (125). Such 
amplified PKS genes can be used to identify clones carrying the entire pathway in 
environmental libraries. Conserved type I PKS primers were used to identify and iso- 
late several novel type I PKS pathways (116). One of these pathways derived from a 
soil genomic library contained PKS/NRPS hybrid genes. Introduction of the PKS-con- 
taining cosmids into S. lividans resulted in the expression of novel fatty dienic alcohol 
isomers. 

Hybridizaton-based approach. Alternatively, identification of novel natural-prod- 
uct pathways is achieved through a high-throughput genome-hybridization method to 
discover metabolic loci independent of their expression (98). Genes that are likely to 
be involved in the biosynthesis of natural products are identified by sequence compari- 
sons to a database of microbial gene clusters known to be involved in natural-product 
biosynthesis. This information is used to design screening probes to identify cloned 
subgenomic fragments (for example, cosmids or BACs) containing the gene(s) of 
interest as well as neighboring genes that together may constitute a biosynthetic gene 
cluster (98,126). 

Expression-based approach. In this approach, libraries consisting of large DNA in- 
serts in a surrogate host are screened for novel activities. This approach has been very 
successful for the discovery of novel enzymes from small-insert environmental librar- 
ies (24,26,113,117). The screening of large-insert environmental soil libraries 
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expressed in E. coli or other expression hosts such as Pseudomonas or Streptomyces 
for antimicrobial activities or for colony pigmentation, led to the isolation of known 
molecules, such as turbomycins (97), violacein (94), indigo blue and indirubin (71), 
and many other known and novel molecules (Diversa Corporation, unpublished data). 
Therefore, the groundwork described for the discovery of novel small molecules de- 
rived from environmental genomic libraries encourages further exploration. 

7. Outlook 

It has been demonstrated that natural -product pathways can be captured from different 
genomes on large-insert vectors and expressed in heterologous systems. The simplified 
construction of environmental libraries with an average insert size larger than 100 kb is 
still challenging. To cover microbial diversity within recombinant libraries requires 
screening of multiple tens of thousands of clones per environmental library. Hence, novel 
high-throughput screening approaches need to be developed to screen this large number 
of recombinant clones without organic extract preparation. The expression level of recom- 
binant small molecules is host dependent and can be low. Therefore, expression hosts 
have to be developed with elevated levels of heterologous expressed molecules (127). 
Different types of hosts should be explored to maximize gene expression levels. The 
knowledge gained from the growing number of bacterial genome sequences will help to 
engineer surrogate hosts for optimum expression (93,128). 

New technologies in microbiology and molecular biology described in this chapter 
will continue to advance the transformation of natural -product discovery, with the focus 
on new culture-dependent and -independent methods to access microbial diversity. 
Future investigations will demonstrate whether microorganisms belonging to phyla that 
have no cultivated representatives so far, produce new bioactive molecules. However, 
in addition to accessing novel biodiversity, faster chemical dereplication methods as 
well as methods to address production and chemical modification of novel chemical 
entities are desired. 
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for Novel Natural Products 
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Summary 

Recent findings suggest that only 1% or less of the total number of soil microbial species can 
be easily cultivated. The fact that uncultured species represent spectacular microbial diversity has 
sparked great interest in these microorganisms as a potentially prolific source of untapped genetic 
diversity encoding novel natural products. Multiple approaches are being developed to access this 
diversity, such as methods to improve the ability to cultivate some of these organisms, most of which 
are not easily grown under standard laboratory conditions. Here we discuss an alternative approach, 
aimed at developing technologies for gaining access to the genomes of uncultivated microbes by 
creating environmental DNA libraries. This method involves isolating large DNA fragments (100- 
300 kb) from soil microorganisms (or from microorganisms derived from other environments), and 
inserting these fragments into a bacterial vector, thus generating recombinant DNA libraries. Such 
libraries are then used to identify novel natural products by various means, including expression of 
the DNA in a heterologous host strain and screening for activities, or by directly analyzing the DNA 
for genes of interest. The recombinant approach thus obviates the need for culturing diverse micro- 
organisms and provides a relatively unbiased sampling of the vast untapped genetic diversity present 
in various microenvironments. As an additional advantage, the genes encoding a product of interest 
are already isolated and can be analyzed using the tools of bioinformatics, thus providing a potential 
boost to the efforts of analytical chemists to identify the product. Furthermore, the possibility of 
regulating the expression of isolated environmental gene clusters or combining them with genes for 
other pathways to obtain new compounds could furnish a further advantage over traditional natural- 
product discovery methodologies. 

Over the last few years, several laboratories have focused their research efforts on a proof of 
concept for the feasibility of this recombinant approach. Areas that have been addressed include (1) 
procedures to generate environmental libraries, (2) assessing the degree of phylogenetic diversity 
encoded in environmental libraries, (3) assessing the feasibility of expressing heterologous DNA in 
multiple expression host strains, and (4) high-throughput methods to screen environmental libraries 
successfully in a reasonable time frame. Here we review the current status of our work in developing 
environmental DNA cloning technologies with respect to these major areas, including compounds we 
and others have found by screening environmental libraries. We also discuss our vision of how to 
transform this exciting new technology from interesting ideas and exciting laboratory results into a 
realistic, productive, high-throughput industrial method for discovering novel natural products from 
environmental microbes. 

Key Words: Natural products; uncultivated microorganisms; diversity; biodiversity; high- 
throughput screening; environmental DNA libraries; heterologous expression; prescreens; reporter 
assays; soil microbes. 
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1. Introduction 

Historically, natural products have played a major role as chemotherapeutic agents. 
The majority of greater than 5000 known anti-infective compounds are derived from 
natural products, with over 100 of these used clinically (1,2). In addition to anti- 
infectives, natural products are strongly represented in the areas of cancer chemothera- 
peutics (3) and immunomodulation (4,5). Methodologies for discovering useful natural 
products in the 20th century evolved from the accidental discovery on a Petri dish of 
the antibacterial effect of penicillin, to more sophisticated discovery technologies, ini- 
tiated by Selman Waksman in 1939. Waksman developed a rational antibiotic screen- 
ing program that resulted in the discovery of streptomycin, thus leading to effective 
treatments for a number of bacterial illnesses. His drug discovery approach provided a 
foundation for the increasingly sophisticated technologies to follow, which included 
the isolation of thousands of microorganisms from soil and other environments, derived 
from areas ranging from New Jersey to exotic Pacific islands, and the development of 
sophisticated high-throughput screening methods to capture whatever novel and useful 
activities were produced by these microorganisms. Although the discovery of new 
structures was initially quite prolific, the discovery rate of novel classes of antimicro- 
bial molecules from natural products has declined in the latter part of the 20th century 
through the present time (6,7), leading industrial research laboratories to reduce or 
abandon their natural-product efforts on the assumption that little remained to be 
discovered. Yet even in the past few years, promising leads for antibacterial cell-wall 
inhibitors derived from natural products have been discovered, and the theory has been 
put forward that compounds that can enter cells and avoid efflux (a problem for many 
compounds derived from rational drug design strategies) have been better designed by 
natural selection (8). Additionally, the "limits for life" continue to be extended, with the 
microorganism that grows at temperatures ranging from 85°C to 121°C as a case in 
point (9). These and other intriguing findings discussed below strongly suggest that 
vast numbers of microorganisms, and hence natural products, remain to be discovered, 
and that there is a strong argument to be made for continuing and extending natural- 
products discovery programs. Recent research using 16S rRNA gene analysis has un- 
covered the exciting finding that the vast majority of bacteria in environmental samples 
are still unknown (10-14). Such analyses (15,16) have shown that these previously 
unknown bacteria belong either to known families or to apparently novel groups. Many 
of these microorganisms are "unculturable" under standard laboratory conditions, 
accounting for their prior lack of detection (1 7). Because the number of microbial spe- 
cies currently cultivatable from soil is thought to represent only 1% or less of the total 
population, these uncultured species have the potential to provide a large pool of novel 
natural products (18-20), if methodologies for tapping into this pool could be found. 
Multiple approaches are being developed to access this diversity for discovery-related 
efforts. One approach is to improve the ability to cultivate some of these previously 
uncultivated organisms, many of which are difficult to grow. Another approach that 
we and others have taken is to create recombinant environmental DNA libraries from 
the genomes of uncultivated microbes, thus providing access to their encoded prod- 
ucts. This approach allows us to circumvent the difficulty in culturing and screening 
most soil microorganisms. The recombinant approach involves extracting microbial 
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Fig. 1. Extraction and cloning of environmental DNA. Large fragments of DNA (>50 kb) 
obtained from uncultured soil microrganisms are ligated into a BAC vector, then transferred 
into various expression hosts. 



DNA from the soil matrix or other microenvironment, and inserting large fragments of 
this DNA into vectors propagated in bacterial strains, such as Escherichia coli, that are 
easy to grow and manipulate genetically. DNA libraries are then screened in various 
ways to detect novel activities, including direct expression of the heterologous genes in 
E. coli and a variety of other, perhaps more suitable, expression hosts, and also by 
screening the recombinant DNA for homologies to known genes of interest. The pro- 
cess for generating libraries is depicted schematically in Fig. 1. Theoretically, this ap- 
proach provides a nonbiased mechanism for accessing biodiversity for drug discovery. 
In addition, it offers some distinct advantages over traditional natural-products screen- 
ing methodologies, which sometimes run into difficulties when trying to reproduce an 
activity of interest from a particular organism. Although the recombinant approach 
also requires analytical chemistry to isolate and identify compounds of interest, the 
DNA encoding the activity is pre-isolated on a recombinant plasmid, and can be iden- 
tified by genetic and other means. The relevant genes can then be sequenced and ana- 
lyzed using bioinformatics tools, thus providing both valuable clues as to the identity 
of the encoded product, and the potential to manipulate levels of production geneti- 
cally. The success of this approach, which we refer to as "molecular biodiversity" tech- 
nology, depends upon many factors, including the ability to clone sufficiently large 
contiguous fragments of environmental DNA into appropriate vectors, and the ability 
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to express or otherwise screen heterologous DNA effectively. Thus, appropriate tech- 
nologies are required to convert this novel approach into an efficient natural-products 
drug discovery tool. Therefore, we have concerned ourselves with developing such 
technologies, including a major emphasis on proving the concept that such an innova- 
tive approach could be successful in discovering novel and useful natural products. 

2. Generation of Environmental DNA Libraries Encompassing Microbial 
Diversity 

2.1. Environmental DNA 

The ability to obtain large DNA fragments is important because many biosynthetic 
gene clusters encoding natural products range in size from approx 30 to 100 kb 
(22-24). Because of their inherent fragility, the isolation of large fragments of environ- 
mental DNA from complex environments such as soil is a difficult process, especially 
because the DNA needs to be sufficiently clean for the enzymatic reactions required 
for cloning. Our group has used two basic strategies — direct and indirect DNA extrac- 
tion — to generate environmental DNA libraries. For the direct approach, DNA is 
extracted from the soil matrix and cloned into a suitable vector, as depicted schemati- 
cally in Fig. 1. Humic acids have been found to be a major contaminant of soil samples 
and can inhibit molecular reactions and reduce transformation efficiency (25). Simple 
electrophoresis and phenol extraction are insufficient for their removal. Therefore we 
developed a process of extracting DNA from soil using gentle hot phenol extraction 
followed by treatment with CTAB (26). The DNA is further sized and purified using a 
discontinuous sucrose density gradient (Fig. 1). According to the indirect approach, 
soil microbes are first isolated by Nycodenz gradient centrifugation, which separates 
microbial cells from the soil matrix, followed by DNA extraction (26). Studies have 
shown that DNA obtained by the indirect method shows no major phylogenetic bias as 
compared with the direct DNA extraction method (27), and that Nycodenz gradients 
recover only bacteria, with no detectable DNA contamination from other organisms 
(27,28). Purified and sized DNA derived by either the direct or indirect methods is then 
cloned into a BAC or cosmid vector following digestion with a restriction enzyme. 
Alternatively, to avoid digesting environmental DNA before cloning (and possibly 
reducing the insert size and/or introducing bias based on G-C content), terminal trans- 
ferase can be used to add polynucleotide tails to the 3' ends of the insert and vector 
DNA (26). 

2.2. Vectors 

We have used both cosmid and BAC vectors as the backbone of environmental 
libraries, and both have been developed with an awareness of the need to express 
environmental libraries optimally in multiple expression hosts (at minimum, E. coli 
and Streptomyces lividans). BAC vectors have the advantage of replicating in single 
copy in E. coli (potentially useful for expression of a product that may be toxic at 
higher gene dosage), and can also stably maintain heterologous DNA inserts of up to 
300 kb (13,29,30). We constructed a new series of BAC vectors, represented by vector 
pMBD14 (Fig. 2), that encompass the elements required for conjugation of DNA into 
Streptomyces (oriT) and, to enhance the stability of the DNA in this host, elements 
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Fig. 2. Three-way BAC cloning vector pMBD14. 



required for subsequent DNA integration into the S. lividans chromosome (i.e., the 
(|)C31 integrase and attachment site, and an apramycin-resistance marker), all flanked 
by loxP sites (31). The utility of conjugation as opposed to transformation into S. 
lividans is discussed later. The BarnHI site in pMBD14 has been used to construct a 
13,000-clone soil DNA library, with insert sizes ranging from 11.5 to at least 85 kb 
(31). On the other hand, cosmid vectors are generally easier to work with, and are very 
suitable when smaller DNA inserts (40-50 kb) are appropriate. Although environmen- 
tal DNA still needs to be carefully purified for cloning into cosmid vectors, cosmids 
are constrained to accept only 40-50 kb DNA, so that DNA sizing steps are unneces- 
sary. Additionally, cosmids are multicopy vectors, which may be advantageous in cir- 
cumstances where an increased gene dosage is desirable, e.g., in the case of a poorly 
expressed but nontoxic product. Cosmid shuttle vector pOS700I was constructed for 
generating environmental libraries. This vector is integrative in S. lividans via the attP 
site and int gene from the Streptomyces integrative element pSAM2 (32), permitting 
site-specific integration of pOS700I into the chromosome of many Streptomyces spe- 
cies. It also encodes both an ampicillin-resistance gene for selection in E. coli, and a 
cassette (33) that confers hygromycin resistance in both S. lividans and E. coli (26). 

2.3. Phylogenetic Diversity of Environmental Libraries 

To evaluate the microbial diversity represented by the DNA used to construct our 
soil libraries, we carried out phylogenetic analyses of DNA sequences encoding small 
subunit ribosomal RNA (SSU rRNA) (10). 

Typical results of these studies (Fig. 3) are consistent with a growing volume of 
work that documents the diversity found in DNA extracted from various soils 
(25,27,34,35). Such studies consistently uncover a wide representation of sequences 
that could be placed within known families isolated from all over the world, but a large 
number are from previously unidentified bacterial families. In another study, 47 rRNA 
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Fig. 3. Phylogenetic analysis of soil DNA. Purified DNA obtained from the soil was used as a template for polymerase chain reaction amplification. 
For the construction of the phylogenetic tree, alignment of the first 4 17 nucleotides (5') from 111 independent clones and known 16S rRNA sequences 
(obtained from Ribosomal Database Project [RDP] at the University of Illinois) was accomplished using ClustalX (freeware). Phylogenetic analysis 
was performed using MacClade (48). The tree was inferred by neighbor-joining analysis of 437 homologous positions. 
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gene sequences were amplified directly from environmental DNA library clones (26). 
Analysis of these sequences showed that the majority were from Proteobacteria, that 
all 47 sequences were unique, and that they were derived from phylogenetically diverse 
microorganisms, most of which have never been screened or isolated. The significant 
finding here was that the diversity seen previously in soil DNA analyses was carried 
over to large pieces of DNA actually extracted from soil, and, importantly, that this 
diversity was then effectively captured for screening in an environmental DNA library. 

3. New Streptomyces and Pseudomonas Expression Host Strains 

Because different microbial strains and species have differing expression capabili- 
ties, the ability to express environmental libraries in a variety of host strains in order to 
improve the chances of detecting novel natural products in screening assays is very 
advantageous. E. coli was the obvious choice as a primary host, as many genetic tools 
are available and it is easy to manipulate. However, the fact cannot be ignored that 
many known (and useful) natural products have been derived from naturally occurring 
polyketides. Actinomycetes are a major source of useful natural products, including 
polyketides and nonribosomal peptides. E. coli does not express functional polyketide 
synthases and nonribosomal protein synthases efficiently, at least partially because it 
lacks the phosphopantetheinyl transferases required for their activation. For these rea- 
sons, and because the expression of heterologous gene clusters and polyketide genes is 
well established in Streptomyces (36), we chose to develop S. lividans as an additional 
host strain for environmental library expression. Optimally, host expression strains 
should lack endogenous activities, including pigments, as their synthesis may poten- 
tially waste available metabolites and therefore interfere with the expression of heter- 
ologous compounds. Additionally, endogenous activities could obscure the detection 
of novel compounds that are produced from heterologous DNA. The act and red gene 
clusters in S. lividans encode the pigments actinorhodin and various undecyl- 
prodiginines, respectively. We deleted these clusters from the S. lividans chromosome 
by positive selection of unmarked allelic exchange mutants (31). The resulting Aact 
cluster deletion encompassed 24.2 Kb (nucleotides 143959 to 168217, GeneBank 
accession number SC0939122), and the Ared cluster deletion encompassed 28.6 Kb 
(nucleotides 144623 to 173286, GeneBank accession number SC0939125). The dele- 
tions were verified by polymerase chain reaction (PCR), and the new S. lividans host 
strain, AactAred, no longer produced actinorhodin or undecylprodiginine. S. lividans 
Aact Ared was shown to grow and sporulate as well as the TK24 parent strain, and thus 
provides the desired background for heterologous natural-product expression and 
analysis (31). An additional improvement to the S. lividans screening system encom- 
passes an environmental DNA sequence found to confer early and enhanced expres- 
sion of secondary metabolites in S. lividans (A. Martinez, manuscript in preparation), 
thus improving our ability to detect heterologous expression in this host. We developed 
an additional host to further extend the host range for expression of environmental 
libraries, the soil organism Pseudomonas putida KT-2440 (37). Many genetic tools are 
available for pseudomonads, including plasmid DNA transfer by conjugation. 
Pseudomonads are known to colonize soil, water, and other diverse environments. They 
encompass rich metabolic diversity, including the ability to degrade xenobiotics and to 
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Fig. 4. Expression host Pseudomonas putida MBD1. The P. putida expression host contains 
the Streptomyces phage (f»C3 1 attB site, enabling integration of BAC plasmids encoding the 
4>C31 attP site and int gene. 



produce secondary metabolites, including polyketides (37,38). Our E. coli-Streptomy- 
ces pMBD14-based shuttle BAC vectors and libraries can be transferred by conjuga- 
tion into this well-characterized nonpathogenic P. putida strain, and we have 
engineered the strain to integrate the incoming vectors into the chromosome by making 
use of the integration properties of the d>C3 1 {Streptomyces phage) system. The attB 
site for (|)C3 1 integration was inserted into the chromosome of P. putida by means of 
the site-specific integration system of P. aeruginosa phage (])CTX (31), thus giving rise 
to expression strain P. putida MBD1 (Fig. 4). Integration of incoming pMBD14 library 
plasmids into the chromosome of P. putida MBD1 is predicated on the ability of the 
(])C3 1 integrase gene, present on the pMBD14 vector, to be expressed in the P. putida 
host strain. This was found to be the case, as discussed later. Note that crude methanol 
extracts of P. putida MBD1 did not produce detectable endogenous antibacterial or 
antifungal activities. Thus our two new host strains, S. lividans AactAred and P. putida 
MBD1, provide an apparently "clean" background for production and detection of het- 
erologous compounds. 

4. Transfer and Expression of Recombinant pMBD14 Plasmids 

As described previously, we developed our shuttle vector and expression host sys- 
tems such that a recombinant vector library encoding environmental DNA inserts could 
be transferred from E. coli to S. lividans and P. putida via conjugation. Especially for 

5. lividans, alternative genetic transfer systems, such as protoplast transformation, are 
laborious and somewhat inconsistent, and the ultimate success of a molecular 
biodiversity screening program depends upon the ability to screen in various expres- 
sion host strains in a high-throughput fashion. Conjugation can be carried out by spot- 
ting cultures on agar Petri plates, and is therefore amenable to a high-throughput robotic 
procedure. We found that E. coli donor strains efficiently transferred recombinant 
pMBD14 plasmids containing inserts of up to 85 kb (the largest size tested) into 5. 
lividans, significantly larger than any reported previously, and recombinant plasmids 
successfully integrated into the S. lividans chromosome (31). Despite the finding that 
some of the larger integrated inserts contained small deletions at one integration junc- 
tion, most of the environmental insert DNA was present in the chromosome of the 
exconjugants, and provided long contiguous stretches of environmental DNA avail- 
able for expression in the new host. Similar analyses showed that these pMBD14 re- 
combinant plasmids can be transferred into P. putida MBD1 and maintained stably in 
the chromosome; conjugation resulted in stable apramycin-resistant P. putida MBD1 
colonies, confirming that the (])C31 integrase gene and the apramycin-resistance gene 
encoded on pMBD14 were expressed and functional in P. putida (31). After confirm- 
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Table 1 

Expression 3 of Heterologous Compounds by Bacterial Expression Hosts 

Escherichia coli Streptomyces lividans Pseudomonas putida 

DH10B AactAred MBD1 

MG1.1 + - - 

Granaticin - + - 

DAPG - - + 

"Expression of compounds was measured in cell extracts by high-performance liquid chroma- 
tography analysis and by antibacterial and antifungal assays, as described in the text. 



ing the feasibility of conjugating recombinant vectors containing large environmental 
DNA inserts from E. coli to both S. lividans and P. putida and ascertaining that the 
recombinant vectors were integrated into the respective chromosomes, we developed a 
plate conjugation method, using a 96-pin array, thus enabling the high-throughput trans- 
fer of recombinant plasmids into the new expression hosts (31). This method for trans- 
ferring pMBD14-based large-insert libraries from E. coli to S. lividans and P. putida 
resulted in a 95% success rate for conjugation and is the first example of BAC vectors 
that can be shuttled by conjugation from E. coli to both Streptomyces and Pseudomo- 
nas. Expression of several heterologous gene clusters present on the pMBD14 vector 
was tested in the various hosts using a series of constructs (pSGran, pSMGl.l, and 
pSDAPG [31]) encoding the synthesis of known antibiotics (granaticin, indirubin, and 
the antifungal compound 2,4-diacetylphloroglucinol [DAPG]). Cells containing these 
plasmids were grown, extracted with ethyl acetate, and the extracts analyzed. Results 
showed that each of the hosts expressed a different cluster, but was unable to express 
the other two (Table 1), as measured by antibacterial or antifungal assays, or by high- 
performance liquid chromatography (HPLC) analysis of extracts. These results call 
attention to the necessity of screening in multiple expression hosts, and underscore the 
critical importance of the three-way conjugative shuttle BAC vector. Further, our find- 
ings point out the advisability of screening in as many hosts as feasible, thereby 
increasing the chances of detecting the expression of molecules of interest. 

5. Screening Recombinant Libraries 

Recombinant environmental DNA libraries can be screened for encoded activities by 
a variety of means that encompass two fundamental strategies: expression of encoded 
activities by host strains, and detection of DNA homologies to known genes of interest. 
Expressed activities can be assayed in live cells (colony screens) or in cell extracts. 
Cell extracts can in turn be assayed for activities in targeted assays or by LC/mass 
spectrometry (MS) analysis, which allows for the rapid dereplication of known natural 
products and can detect the presence of new molecules not found in the host back- 
ground. When "hits" (i.e., clones that are positive in an initial screen) are identified, 
they are retested at least once. The corresponding BAC plasmid is then isolated and 
introduced into a fresh host cell and the phenotype rechecked, in order to verify that the 
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environmental DNA insert in the BAC plasmid is the source of the phenotype. Once 
confirmed, transposon-mediated insertion mutagenesis is used to disrupt the encoding 
environmental genes. Cells containing transposon insertions are screened for loss of 
the acquired phenotype (25). The encoding environmental DNA can then be sequenced 
from the transposon ends in order to determine the sequence of the gene(s) of interest. 
The resulting DNA sequence can then be analyzed using bioinformatics tools, which 
can help identify function, novelty, and phylogeny of the genes and expressed prod- 
ucts. Our initial screening efforts (and those reported by others) have concentrated 
principally on screening E. coli libraries. This is because our major focus was first the 
proof of concept for molecular biodiversity technology, followed by the development 
of technological improvements aimed at incorporating these first principles into a real- 
istic and high-throughput drug discovery program. However, despite the fact that E. 
coli is not an ideal expression host for reasons already discussed, a number of interest- 
ing genes and activities have been discovered in E. coli environmental libraries, lend- 
ing much encouragement to the enormous potential for this emerging technology. 

5.1. E. coli Colony Screens 

We, as well as other workers in the field, have screened E. coli colonies containing 
environmental DNA. Colonies were grown on agar Petri dishes and screened for ac- 
tivities that could be assessed easily using plate assays, including antibacterial (against 
Bacillus subtilis or Staphylococcus aureus) and antifungal (against Candida albicans) 
activities. Early libraries were also assayed for enzyme activities in agar plate screens, 
in order to provide data addressing the frequency of appearance and expression of het- 
erologous genes (25,26,34). These screens can normally be carried out robotically, 
using large (9" x 9") agar trays onto which colonies are transferred using a 96-pin 
array. Individual screens make use of assay-specific agar, which is often commercially 
available in the case of enzyme assays. For antibacterial screening, colonies are nor- 
mally grown for several days, usually at 37°C for 1 d and at room temperature for some 
additional days, and overlaid with soft agar containing an indicator strain (e.g., B. 
subtilis or other Gram-positive bacterium). After further incubation, plates are scored 
for activity by looking for a zone of inhibition in the B. subtilis lawn surrounding an 
environmental library colony. Antifungal screening follows a similar process (31). The 
choice of incubation temperatures for environmental libraries is an interesting issue, 
since E. coli's optimum temperature of 37 °C is not necessarily (or likely to be) optimal 
for expression of a particular heterologous gene derived from a soil environment. There- 
fore, it is beneficial to screen library colonies after growing for various amounts of 
time and at several temperatures. 

5.2. HPLC Screens 

An additional method of screening for expression of heterologous DNA in environ- 
mental libraries involves HPLC or LC/MS analysis of cultured library clones, aimed at 
detecting and characterizing new metabolites that are produced by the cells and appear 
in cell extracts. Peaks that do not match those found in the host strain bearing the vector 
alone potentially reveal the presence of homologous compounds not present in the 
host. Although cumbersome, the method can be carried out in a fairly high-throughput 
fashion, and has succeeded in detecting novel compounds in environmental libraries 
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Table 2 

Some Molecular Structures Isolated From E. Co// (Soil) Environmental Libraries 



Library Number Average DNA Type of 

Type of insert origin screen 4 
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Environmental library screens are described in the text. 

b 8 amylase, 2 lipase, and 1 DNase activities were detected, all distinct and novel 

(33). information not available. 



(Table 2) (26,39). In one such library (26), five novel structures were detected in a 
library of 5000 clones, each clone containing about 40 kb of insert DNA. However, 
none of these novel structures had a detectable activity, and therefore an estimate of the 
number of colonies needed to be screened by this method in order to obtain an interest- 
ing molecule cannot yet be derived. 

5.3. Environmental DNA Homology Screens (Molecular Screens) 

Environmental DNA encoded in library clones can also be screened for homologies 
to known genes of functional relevance. The advantage of this type of screen is that 
detectable expression of the encoding DNA is not required in a particular host strain. A 
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disadvantage, on the other hand, is that the screen is restricted to known classes of 
molecules. One method of screening for DNA homologies involves amplifying spe- 
cific biosynthetic gene sequences of interest from recombinant vectors. For example, 
polyketides, a vast group of structurally diverse natural compounds produced by a large 
variety of soil microorganisms, are a major class of natural products. The existence of 
highly conserved regions of actinomycetes type I polyketide synthases (PKS 1) DNA, 
flanking the active site of the keto acyl synthetase domain, provided two sets of primer 
sequences that could be used for the PCR amplification of homologous genes, and thus 
furnished a means for gauging the frequency of appearance of genes encoding 
polyketide biosynthetic enzymes in environmental libraries (26). 

5.4. Results of Screening Environmental Libraries 

Much of the work describing the potential for drug discovery from environmental 
DNA libraries by these and similar means has not yet been published; therefore the list 
of molecules that have been discovered via molecular biodiversity screening is very 
incomplete. However, a partial list of molecules, derived from the work of several 
different laboratories, is shown in Table 2. Again, when considering these results it is 
important to keep in mind that the majority of published screening results describe 
work only with E. coli as the expression host. Extensive multi-host large-scale screen- 
ing of environmental libraries using our three-way shuttle B AC technologies is a prom- 
ising way forward but has only just begun. Nevertheless, available screening results 
provide very useful data and firmly prove the concept that novel and interesting mol- 
ecules can be found in environmental libraries by the means described here, even when 
such libraries are expressed in a less-than-ideal expression host. Table 2 also shows 
that activities derived from E. coli libraries frequently consist of families of small mol- 
ecules, rather than single discrete compounds. These compounds or families are encoded 
by single genes, or gene clusters ranging from 3 to 13 genes (25,41). 

Although some screening has been carried out in streptomycetes, this has so far been 
on a small scale, as the cumbersome transformation procedure previously required to 
transfer plasmids into streptomycetes has precluded a genuinely high-throughput effort. 
The three-way shuttle BAC vector pMBD14, which permits high-throughput conjuga- 
tion from E. coli to S. lividans, will greatly enhance the ability to screen libraries in the 
S. lividans host. Very encouragingly, DNA homology screens to determine the preva- 
lence of PKS I genes in our cosmid environmental library revealed 1 1 distinct type I 
PKS gene sequences, a number much higher than expected in a random and relatively 
small (<250 Mb) DNA library sample (26). Additionally, the identification of a partial 
PKS 1 pathway with nearly six genes in one clone of this library strongly suggests that 
complete clusters of polyketide or other biosynthetic genes would be expected to 
appear in a library containing more clones and larger inserts. Our screening focus has 
been primarily on the detection of heterologous small molecules, consistent with our 
goal of augmenting chemical libraries. We have also screened for enzyme activities as a 
means of gauging the frequency of appearance and expression of heterologous sequences 
in our environmental libraries. Note that because enzymes are normally encoded by a 
single gene rather than by large multigene pathways, their discovery requires much 
smaller environmental DNA inserts. 
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6. Technologies for Generating Quality Environmental Libraries 
for High-Throughput Screening 

The data in Table 2 give us a rough idea of the frequency with which heterologous 
molecules might be detected in E. coll environmental libraries, and lend credence to 
the concept that this type of technology is feasible for drug discovery. Note that one 
E. coll BAC library generated a hit rate for antibacterial activities of roughly one 
antibacterial clone per 60 MB of soil-derived DNA (25), although this number is 
expected to vary depending upon insert size and many other factors. Natural-product 
screens directed against a variety of targets usually require the preparation and screen- 
ing of cell extracts, however. For libraries with hit rates similar to those described 
above, most of the extracts derived from library clones would be unlikely to contain 
novel molecules, let alone molecules active against specific targets of interest. For this 
reason, more recent efforts of our group have been directed toward generating quality 
libraries, in which every (or nearly every) clone is known to contain a heterologous 
product that can be screened productively in an assay. To this end, we have replaced 
the concept of screening static environmental libraries with a more dynamic approach 
that involves the continuous generation and prescreening of environmental DNA clones 
to select out those that are producing heterologous products. The goal is to generate 
an efficient high-throughput method for identifying and separating out those cells in 
environmental libraries that encode heterologous bioactive natural products. 
Prescreens thus serve as a filter to isolate producing clones. We have begun to develop 
reporter strains of E. coli for use in prescreening. The method (Fig. 5) utilizes a battery of 
E. coli strains that have been engineered to encode, in single chromosomal copy, 
chemoresponsive promoters that respond to particular classes of metabolites, fused to a 
reporter gene (encoding green fluorescence protein, GFP). Bacterial stress-response 
genes and multidrug efflux pumps are known to be activated in response to certain 
chemicals, and therefore a cell producing a compound expressed from heterologous 
DNA might be expected to be induced for a stress and/or efflux response, resulting in 
GFP production (42,43). For example, certain classes of antibiotics are known to pro- 
duce characteristic stress responses (44). The GFP prescreen can be carried out with a 
multitude of different stress and efflux pump fusion strains, many of which we have 
already constructed (Ian MacNeil, unpublished results). An additional attractive fea- 
ture is the very high throughput nature of this prescreen: cells producing GFP can be 
sorted by flow cytometry and then isolated (45), as diagrammed in Fig. 5. Clearly this 
potentially powerful process enables rapid sorting of producing clones from a back- 
ground of millions of nonproducing clones. Simple prescreens for S. lividans have also 
begun to be developed (e.g., overproduction of actinorhodin in response to the produc- 
tion of a heterologous molecule (26)), and should further facilitate the ability to gener- 
ate quality libraries from a continuous supply of millions of environmental DNA clones. 
Other prescreens for all hosts comprise high-throughput (robotic) antibacterial and an- 
tifungal colony or extract screens, as already described. The availability of multiple 
means for prescreening millions of clones is likely to be a critical step in deriving value 
from the shotgun approach to natural-product discovery from environmental microbes. 

In addition to developing effective prescreens for generating quality environmental 
libraries, it is clearly advantageous to incorporate into the process as many expression 
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Fig. 5. Escherichia coli prescreening strains. A battery of prescreening strains each contains 
a stress or efflux promoter fused to a green fluorescence protein reporter gene, integrated into 
the E. coli chromosome. 



hosts with differing expression capabilities as possible. For example, Myxcoccus 
xanthus is an excellent candidate. Myxobacteria are widely distributed in soils (46) and 
are know to synthesize many natural products. More than 80 basic structures have been 
isolated from myxobacteria, including many that are novel (47). The genome of M. 
xanthus is nearly twice the size of that of E. coli. As a large percentage of its genome 
appears to be devoted to secondary metabolite biosynthesis, it is reasonable to expect 
that this host would have many of the tools required to express environmental DNA 
and possibly modify the expressed product in a variety of ways. Furthermore, genetic 
tools are available for M. xanthus, including transduction and conjugation (48), sug- 
gesting that its genome could also be engineered to accept the three-way shuttle BAC 
plasmid. In addition to bacteria, lower eukaryotic hosts, such as fungi, for which genetic 
tools are also available, are attractive possibilities for expression hosts. 

With the technologies already developed, we have set up a process for generating 
and prescreening environmental libraries, diagrammed in Fig. 6. According to this pro- 
cess, thousands of environmental library clones are generated weekly and transformed 
into E. coli. The clones are then prescreened in a variety of ways, including reporter 
stress/efflux pump and antibacterial/antifungal screens. Positive clones, once vali- 
dated, are added to the high-quality library, and are also conjugated into S. lividans 
and P. putida. In addition, random E. coli clones that are not positive in the E. coli 
prescreen would also be conjugated into the other host strains for prescreening. Con- 
firmed positives from at least one of the prescreens in one of the hosts would comprise 
the high-quality environmental library, and organic extracts of these clones would be 
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Fig. 6. High-throughput screening process for expression of environmental libraries. The 
goal of this process is to screen millions of environmental clones quickly and efficiently in 
order to identify those clones producing heterologous molecules. Producing clones are then 
selected to be included in a "quality" environmental library, which is then screened as cell 
extracts against targets of interest. 



prepared for high-throughput screening against targets of interest. This screening 
scheme is just one of many possibilities for generating high-quality environmental 
extract libraries. Note that the fluid nature of molecular biodiversity technology per- 
mits its use in nearly any type of screening program, since prescreens can be specifi- 
cally tailored toward targets of interest. 

7. Conclusions 

Screening as yet uncultured microorganisms for novel and useful natural products 
by means of shotgun cloning environmental DNA is an exciting new technology that 
has the potential to become a powerful tool for drug discovery. We and others have 
confirmed that genes encoding natural products can be readily captured using this strat- 
egy, and that heterologous host strains can be optimized to express environmental librar- 
ies. In addition, conjugative transfer of BAC plasmids containing environmental DNA 
makes feasible the transfer and screening of entire large-insert DNA libraries in mul- 
tiple host expression strains. It is our view that in order to maximize the chances of 
discovery, environmental libraries need to be generated and screened in a high-through- 
put fashion, using as many different expression hosts as practical, thus greatly increas- 
ing the chances for success. At this point the technology has succeeded in the proof of 
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concept stage, and is poised to move forward rapidly. The next important step to a 
productive drug-discovery program, in which this potentially vast source of natural 
products derived from uncultivated environmental microorganisms can be fully exploited, 
is critically dependent upon a firm commitment and adequate funding. 
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Summary 

There is currently renewed interest in the study of microorganisms as a source of structurally 
unique and pharmacologically active natural products. Actinomycetes, being the single most produc- 
tive source of naturally occurring antibiotics, are a logical component of these studies, and success 
with this group will be enhanced by the inclusion of previously unknown taxa. Recent studies of 
marine-derived actinomycetes have revealed the widespread distribution of unique marine taxa resid- 
ing in ocean sediments. Chemical studies of these strains, focusing on members of the new genus 
Salinospora, have led to a high rate of novel secondary metabolite discovery, including molecules 
with potent biological activity. Given the encouraging results from preliminary studies of these newly 
described marine bacteria, it seems clear that marine actinomycetes represent an important future 
resource for small-molecule drug discovery. 

Key Words: Marine actinomycetes; natural products; secondary metabolites; phylogenetic 
diversity. 

1. Introduction 

Members of the order Actinomycetales include slow-growing, filamentous bacteria 
that are abundant in soil, where they play important roles in the breakdown of recalci- 
trant organic materials (1). Historically, these Gram-positive bacteria are best known 
for their unparalleled ability to produce structurally diverse secondary metabolites, a 
capacity that made actinomycetes a central component in the development of the mod- 
ern pharmaceutical industry. Their role as a source of useful pharmaceuticals is high- 
lighted by the remarkable fact that, as of 1988, actinomycetes accounted for 
approximately two-thirds of the naturally derived antibiotics discovered (2), clearly 
making them the single most important source of naturally occurring medicines. 

Although the positive impact of actinomycete secondary metabolites on human 
health is clear, there is a perception that 50 yr of intensive research by the pharmaceu- 
tical industry has exhausted the supply of compounds that can be discovered from this 
group of bacteria. This perception has been a driving force behind the recent shift away 
from natural products as a resource for the discovery of small-molecule therapeutics 
towards other drug-discovery platforms, including high-throughput combinatorial syn- 
thesis and rational drug design. These new technologies, however, have proven to be 
less productive at yielding quality leads de novo (as opposed to, e.g., combinatorial 
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synthesis as a method for modifying a pre-existing chemical scaffold), and as a result 
there has been a recent groundswell of renewed interest, largely from the biotechnol- 
ogy sector, in returning to nature as a source of new structural motifs. 

So the question can be asked, is it logical to reinvestigate actinomycetes as a source 
of natural products given the intensity with which these bacteria were studied in the final 
five decades of the twentieth century? In other words, are the returns that can be antici- 
pated from these bacteria so skewed towards the isolation of known chemical scaffolds 
that the effort required to discover the few remaining new molecules cannot be justi- 
fied? That is, has the chemical potential of the actinomycetes been exhausted? In the 
case of Streptomyces, the genus from which the largest number of antibiotics have 
been discovered (3), it was recently estimated that only 3% of the compounds produced 
by this taxon have been characterized (4), thus implying that further studies, even of 
this relatively well-characterized genus, are warranted. Given that improved analytical 
techniques now allow for the rapid dereplication of known compounds and the detec- 
tion and characterization of molecules that occur at very low concentrations, the search 
for new natural products can be carried out with greatly enhanced efficiency. When 
coupled with high-throughput screening, target-specific assays, improved cultivation 
techniques, functional genomics, and our growing understanding of the molecular basis 
of secondary metabolite production, there can be little doubt that many promising actino- 
mycete secondary metabolites await discovery. In fact, we may now be entering a new 
era in which these technologies are creating a "paradigm shift" in our approach to 
natural-product discovery (5), a shift with the potential to result in a rate of pharmaceu- 
tical lead discovery that will eclipse even that which occurred during the "Golden Age" 
of the pharmaceutical industry. 

There are many approaches that can be applied to the discovery of novel actino- 
mycete secondary metabolites, and certainly high among these is the search for new 
actinomycete taxa. More than 20 yr ago, as success rates from streptomycetes 
dropped, efforts to explore the ability of nonstreptomycete actinomycetes, such as 
Micromonospora sp., to produce unique metabolites proved invaluable (6). Today, 
however, Micromonospora is no longer underexplored, relatively speaking, and a dif- 
ferent set of "rare" genera, such as Actinokineospora, are garnering attention for their 
chemical prolificacy (7). Thus, the strategy of seeking out new actinomycete taxa 
remains a valid approach to drug discovery, with questions addressing where these 
taxa occur and what is required for their cultivation becoming key to the success of 
future natural-product discovery programs. 

Being at an oceanographic institution, we have long been interested in the concept 
of the existence of marine actinomycetes and the potential of these bacteria as a source 
of new secondary metabolites (e.g., refs. 8,9). Although a case for the existence of 
indigenous marine actinomycete populations was made more than 30 yr ago (10), the 
occurrence of marine actinomycetes has not been widely embraced either by the marine 
microbiology community or industrial microbiologists, and a prevailing sentiment 
expressed in the literature suggests that the vast majority of marine-derived strains 
are of terrestrial origin (e.g., ref. 11). Given that many soil actinomycetes produce 
resistant spores that are washed in large numbers from shore into the sea, where they 
have the potential to remain viable yet dormant for an undetermined period of time, 
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this skepticism was not without basis. However, evidence in support of the existence of 
marine actinomycetes has been accumulating with the discovery that some strains dis- 
play specific marine adaptations (12) while others appear to be metabolically active in 
marine sediments (13). Further support came from the description of Rhodococcus 
marinonascens, the first marine actinomycete species to be characterized (14). This 
prior evidence in now further corroborated by the recent discovery of a new marine 
actinomycete taxon that is widely distributed in ocean sediments (15). 

This chapter is not an attempt to review the approx 100 novel compounds that have 
been reported from marine-derived actinomycetes over the last 20 yr, as much of this 
information has been summarized elsewhere (e.g., refs. 16,17 and references cited 
therein). We will instead focus on recent advances in our understanding of marine 
actinomycete diversity and the secondary metabolites that have been discovered from 
strains that either represent new marine taxa or are clearly adapted to life in the ocean. 

2. Studies of Marine Actinomycetes 

As part of a program to explore marine actinomycetes as a source of novel second- 
ary metabolites, we have been studying actinomycete populations that occur in ocean 
sediments. These studies have incorporated modern concepts in natural-product isola- 
tion and structure elucidation along with new cultivation methods and the application 
of molecular techniques to assess both culture-dependent and culture-independent 
microbial diversity. Although actinomycetes can readily be isolated from marine 
sediments if the appropriate selective techniques are applied, it is only through the use 
of molecular phylogenetics that we have begun to get a clear picture of the diversity, 
distributions, and taxonomic uniqueness of certain populations. Once recognized as 
unique, these populations can then be selected as the focal point of intensive natural- 
product studies. 

Using the SSU rRNA gene to trace evolutionary relationships among actinomycetes, 
it has become increasingly clear that the oceans harbor phylogenetically unique actino- 
mycete populations and that these bacteria can be readily obtained in culture and stud- 
ied as a source of new natural products. More importantly, from what can only be 
considered preliminary studies, these bacteria have been found to produce a remark- 
able range of structurally unique secondary metabolites. This finding suggests that 
adaptations to life in the sea, coupled with genetic isolation from terrestrial strains, 
has led not only to the evolution of new actinomycete species that are highly adapted to 
life in the ocean but to the production of secondary metabolites that are unlikely to be 
discovered from nonmarine strains. 

2. 1. Salinospora 

Our first glimpse into the phylogenetic diversity of marine actinomycetes came 
following a study of sediment populations cultured from samples collected from the 
Bahamas (15). Many of these bacteria required seawater for growth, and, since this 
physiological requirement had never previously been reported for actinomycetes, we 
suspected that they represented a new marine taxon. However, it wasn't until we 
applied SSU rRNA sequencing methods that it became apparent that these seawater- 
requiring strains were also unique at the genetic level. These findings led us to the 
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conclusion that highly adapted marine taxa occurred in ocean sediments and that these 
bacteria were significantly different from their terrestrial counterparts. Looking back 
at strains isolated from some of the same locations more than 10 yr earlier (12), we 
realized that they also belonged to this new marine phylotype, and that members of this 
group represent a consistent component of the sediment bacterial community. These 
observations, coupled with the recovery of related strains from tropical marine loca- 
tions worldwide, led us to believe that we had cultivated the first marine actinomycete 
taxon that was persistent and widespread in tropical ocean sediments. Based on their 
level of phylogenetic uniqueness, we proposed the generic epithet Salinospora to 
describe this group (Fig. 1). As of this writing, a formal description of the genus 
"Salinospora," including two type species ("S. tropica" and "S. arenicola"), has been 
submitted for publication (18). If accepted, "Salinospora" will be the first marine acti- 
nomycete genus to be formally recognized, marking an important milestone in a slow 
march towards the recognition that actinomycetes are part of the indigenous marine 
microbiota. 

Given our new appreciation of the phylogenetic diversity of marine actinomycetes, 
we began detailed chemical evaluations of members of the Salinospora group. Organic 
extracts of these actinomycetes, whose closest relatives are Micromonospora spp., 
another important group of secondary metabolite producers, yielded remarkable lev- 
els of biological activity. Bioassay-guided fractionation of the extract of one strain 
(CNB-392) quickly led to the discovery of an extraordinarily potent cytotoxin that we 
have called salinosporamide A (19). 

Salinosporamide A (1, Fig. 2) contains an unusual bicyclic y-lactam- (3-lactone ring 
system that forms the core of the molecule. It displayed potent in vitro cytotoxicity in 
assays against the human colon tumor cell line HCT-1 16 with an IC 50 of 35 nM. When 
tested in the National Cancer Institute's (NCI's) 60-cell panel, the mean IC 50 was <10 
nM and it displayed a four-log range in activity among the least and most sensitive cell 
lines in both total growth inhibition (TGI) and LC 50 (concentration required to kill 50% 
of the cells), indicating a high degree of cell line selectivity. Based on its potency and 
selectivity, salinosporamide A (1) was selected for the NCI's hollow fiber assay, 
where the score achieved was not sufficient for the compound to advance to xenograft 
studies, possibly a result of aqueous hydrolysis of the (3-lactone, which we have 
observed in some cases, depending upon handling conditions. 

Salinosporamide A (1) is most closely related to omuralide (2), also known as clasto- 
lactacystin (3-lactone, the hydrolysis product of lactacystin (20) that has been described 
as the only known truly specific proteasome inhibitor ever discovered (21). Although 
salinosporamide A (1) shares an identical bi-cyclic ring structure with omuralide (2), it 
is uniquely functionalized, possessing a methyl at the C-3 ring juncture, a chloroethyl 
instead of a methyl at C-2, and a cyclohexene instead of an isopropyl at the C-5 posi- 
tion. Like omuralide (2), salinosporamide A (1) is also a potent proteasome inhibitor, 
inhibiting the chymotrypsin-like proteolytic activity of the 20S proteasome with an 
IC 50 value of 1.3 nM, which is approx 35 times more potent than omuralide (2, IC 50 = 
49 nM) when tested in the same assay (data provided by Nereus Pharmaceuticals, Inc., 
San Diego). 

Like many bacteria that possess the genetic capacity to produce secondary metabo- 
lites, strain CNB-392 does not produce only one molecule but a series of related com- 
pounds. To date, in addition to salinosporamide A (1), we have isolated one additional 
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Fig. 1. Neighbor-joining phylogenetic representation of cultured marine actinomycetes and their 
closest National Center for Biotechnology Information (NCBI) (BLASTn) relatives based on almost 
complete SSU rRNA gene sequences. Phylogenetically unique marine actinomycetes include the 
new marine genus "Salinospora" along with the proposed genus "Marinophilus" and a potential 
third genus that is currently being described as MAR4. Representatives of all three new phylotypes 
produce secondary metabolites that possess heretofore undescribed carbon skeletons. Our strain 
numbers are in bold and bootstrap values (1000 re-samplings) are presented at the respective nodes. 
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Fig. 2. Actinomycete secondary metabolites. 



natural product and a series of related degradation products (22), and from structure 
activity relationships have been able to gain insight into the features of compound 1 
that are associated with its potent biological activity. 

The first structural feature of salinosporamide A (1) that was found to be associated 
with its HCT-116 cytotoxicity became apparent following the isolation of salino- 
sporamide B (3), the dechloro-analog of 1. Loss of the chlorine atom at C-13 results in 
a greater than 500-fold loss in activity, with a reduction in the IC 50 to 20 \lM. This is not 
unprecedented, as chlorine, being a good leaving group, facilitates biological nucleo- 
philic attack at the adjacent carbon by nucleophiles such as DNA or, in this case, most 
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likely a protein. Chlorine has been shown to be required for activity in a variety of 
microbial metabolites including vancomycin (23), the antibiotic of last choice against 
MRSA, and thus salinosporamide A (1) joins the ranks of chlorinated metabolites in 
which a chlorine substituent is an essential element for biological activity. 

But the chlorine atom at C-13 is not the only active site in the salinosporamide A (1) 
molecule. This became clear with the isolation of the salinosporamide degradation 
products (4), a derivative in which the chloro-ethyl functionality is maintained but the 
(3-lactone is missing. Loss of the (3-lactone ring reduces the activity of compound 4 
5000-fold in comparison to compound 1. Thus it appears that at least two sites in the 
salinosporamide A (1) molecule are associated with its biological activity. Of the remain- 
ing molecules in the series, all but salinosporamide B (3) are chlorinated, while only A 
(1) and B (3) maintain the intact (3-lactone ring. As of this writing, Salinosporamide A (1) 
is proceeding through preclinical development as an anticancer agent at Nereus Phar- 
maceuticals, Inc. 

The exquisite selectivity with which omuralide (2) is capable of inhibiting 
proteasome function, without inhibiting other proteases or lysosomal protein degrada- 
tion (21), has made it a powerful tool in the study of protein biochemistry and cell 
biology. It has also generated interest in the structural features of compound 2 that are 
required for proteasome inhibition. Of specific interest to our studies of salino- 
sporamide A (1) was the finding that the omuralide (2) C-9 isopropyl is optimal for 
activity and that other substituents at this position, including phenyl, resulted in sig- 
nificantly reduced activity (24). In addition, the X-ray crystal structure of the 
omuralide-inactivated 20S proteasome indicated that the isopropyl group fits into a 
lipophilic pocket. Because activity was greatly reduced by the substitution of diverse 
alkyl groups at C-9, it could be implied that there was a fairly snug fit of the isopropyl 
group into the complementary binding pocket (24). Given that salinosporamide A (1) 
possesses the bulkier cyclohexene substituent at C-9, and that in omuralide a phenyl at 
this position resulted in a complete loss of activity, compound 1 may have a unique 
20S proteasome binding site. 

2.2. Additional Compounds From "Salinospora" spp. 

In the course of performing time-dependant studies of salinosporamide A (1) pro- 
duction, we observed the production of two new molecules late in the fermentation 
period. Analysis of these compounds by liquid chromatography (LC)/mass spectrom- 
etry (MS) indicated that both molecules possessed a molecular weight of 536 and a 
single chlorine atom. Subsequent cultivation and extraction of a second strain (CNB- 
477), which was found to produce these compounds in higher yield, followed by a 
series of chromatography steps, led to the isolation of the heterocyclic macrolides 
sporolides A (5) and B (6), which differ only in the position of the chlorine substituent 
on the aromatic ring. These compounds possess unprecedented carbon skeletons and 
appear to be formed from two independently produced polyketide precursors. Both 
compounds were inactive in our cytotoxicity and drug resistant antimicrobial screens, 
and possessed insufficient activity in the NCI 60-cell panel (GI 50 24.5 and 49.0 uM, 
respectively) to be considered for further action. Given the fact that they are chlori- 
nated and possess epoxide functionalities, the lack of activity is somewhat surprising, 
and further testing may be warranted. 
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Thus far from one "Salinospora" strain (CNB-392), we have isolated two new se- 
ries of molecules (the salinosporamides, compounds 1 and 3, and the sporolides, com- 
pounds 5 and 6) and are currently working on the structure elucidation of other new 
bioactive compounds, including a series of polyketide-derived antibiotics, from the 
extracts of additional "Salinospora" strains. In general, "Salinospora" extracts dis- 
play an extraordinary level of biological activity, with >50% producing crude organic 
extracts with HCT-1 16 IC 50 values of <10 ug/mL. In some cases, the observed extract 
activities have been ascribed to compounds related to staurosporines and rifamycin, 
two well-known classes of actinomycete cytotoxins. This brings up the point that it is 
unrealistic to expect that new actinomycete taxa will only produce secondary metabo- 
lites with previously undescribed carbon skeletons. The logic for seeking out new taxa 
is the belief that they will have a greater likelihood of producing new compounds, as 
has been the case with "Salinospora " species, not the unrealistic hope that all of the 
molecules that they produce will be new. 

Other research groups worldwide have also been investigating marine-derived acti- 
nomycetes, and some, such as Okami's group at the Institute of Microbial Chemistry in 
Tokyo, were among the first to explore the chemical potential of these bacteria and 
should be recognized for their pioneering work (25). As was our experience, many 
early efforts did not emphasize taxonomically unique marine actinomycetes or those 
that were specifically adapted to life in the ocean, and, as a result, many known com- 
pounds or new derivatives of known skeletons were obtained. Given our growing under- 
standing of marine actinomycete diversity, along with improvements in our ability to 
culture these bacteria, we feel confident that future chemical studies, targeting specific 
marine species, will begin to reveal the true chemical potential of these bacteria. 

Although our goal here is not to summarize the findings of other research groups, it 
is of particular relevance to discuss a recent discovery by the Wyeth- Ayerst group (for 
a review of this program, see ref. 26). As part of their effort to investigate the potential 
microbial origin of the potent antitumor agent namenamicin, originally isolated from 
extracts of the marine ascidian Polysyncraton lithostrotum, the Wyeth group cultivated 
what appeared to be a new Micromonospora sp. from tissues of P. lithostrotum col- 
lected off Fiji (27). In culture, this halophilic strain did not produce namenamicin; 
however, culture extracts displayed potent DNA-damaging activity, and bioassay- 
guided fractionation led to the isolation of the new dimeric diazobenzofluorene glyco- 
side antitumor antibiotics lomaivitacins A (7) and B (8). The authors were kind enough 
to provide us with the SSU rRNA sequence data for the producing strain, and it clearly 
falls within the "Salinospora" clade. Thus, as has been our experience, strains belong- 
ing to the "Salinospora" clade can be isolated from the tissues of marine invertebrates 
as well as from marine sediments. The Wyeth- Ay erst discovery adds a third new 
chemotype to the diverse repertoire of secondary metabolites produced by this new 
group of marine actinomycetes. 

2.3. "Marinophilus" 

Given the high rate of new compound production by "Salinospora" strains, we ini- 
tiated a study to cultivate new "Salinospora " species as well as to determine whether 
additional marine actinomycete taxa could be recovered from marine samples (28). 
Using the requirement of seawater for growth as a selective criterion, we quickly rec- 
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ognized a second new phylotype, for which we hope to propose the generic epithet 
"Marinophilus" (Fig. 1). Unlike "Salinospora" strains, "Marinophilus" are not limited 
to tropical waters. They are also, unfortunately, unlike "Salinospora" in that they can- 
not be recovered in large numbers from marine sediments using any of our current 
isolation methods. Although we have cultured relatively few "Marinophilus" strains 
thus far, they do appear to have one thing in common with "Salinospora" spp. — the 
ability to produce unique secondary metabolites. 

Initial chemical studies of the "Marinphilus" strains in our collection quickly led to 
the discovery of a series of structurally unique antitumor- antibiotics that we have called 
marinomycins A-D ([9-12], Fig. 3). The structure elucidation of these compounds 
proved difficult due to the symmetry of the molecules and their many stereo centers; 
however, these issues have now been resolved and the compounds will soon be submit- 
ted for publication (29). Although the marinomycins are polyene-like macrolides, they 
do not possess the antifungal activities typically associated with many polyene metabo- 
lites. They do, however, display potent cytotoxicities in the NCI 60-cell panel. Of the 
three compounds thus far tested (compounds 9-11), the mean GI 50 values were 18.6 nM, 
12.6 nM, and 5.1 nM, respectively. Possibly of greater significance, all three com- 
pounds demonstrated a four-log range in LC 50 values between the least and most sensi- 
tive cell lines, with a pronounced spike in activity against one specific melanoma. These 
compounds are currently being subjected to additional testing at the NCI. We hope in 
the future to develop improved cultivation techniques for "Marinophilus" spp. and to 
further explore the chemical potential of this new actinomycete taxon. 

2.4. Marinone Producers 

Although we now possess a better understanding of the taxonomic uniqueness of the 
marine-derived actinomycetes we are cultivating, this was not always the case, and, in 
the early days of our program, we routinely relied on broad chemical and biological 
screening in our search for new metabolites. For example, in 1989, we isolated an 
actinomycete (strain CNB-632) from the Torrey Pines Estuary in La Jolla, CA. Chemi- 
cal studies of this strain led to the isolation of two new meroterpenoid antibiotics, 
marinone (compound 13) and debromomarinone (compound 14), examples of rare bio- 
synthetic hybrids of mixed polyketide-terpenoid origin (30). These compounds share a 
novel sequiterpene napthoquinone carbon skeleton and are most closely related to the 
actinomycete-derived monoterpene napthaquinones naphterpin (31) and the naphth- 
geranines (32). They also possess antibiotic activities against Gram-positive bacteria, 
with MIC values ranging from 2 to 4 \yM. 

Four years later, while collecting sediment samples from the Batiquitos Lagoon, 
Carlsbad, CA, approx 50 km north of the original collection site, we isolated another 
actinomycete strain (CNH-099), which produced not only compounds 13 and 14 but 
also several new derivates (15, 16) in the marinone series as well as the new metabolite 
neomarinone (compound 17 [33]), the structure of which was subsequently revised 
(34). While marinone and its analogs (compounds 13-16) contain a nonrearranged ses- 
quiterpene residue, neomarinone (compound 17) features a rearranged C-15 substitu- 
ent that occurs on the nonquinone side of the molecule. Feeding experiments with 
13 C-labeled acetate by Moore's group at the University of Arizona indicated that, like 
in naphterpin, the naphthoquinone portions of both marinone and neomarinone were 
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Fig. 3. Actinomycete secondary metabolites. 
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derived from a symmetrical pentaketide intermediate and thus are of polyketide origin. 
Unlike the monoterpene portion of naphterpin however, 13 C-labeled glucose feeding 
studies revealed that the sesquiterpene portions of compounds 14 and 17 are not de- 
rived from the mevalonate pathway but instead appear to originate from the recently 
described nonmevalonate terpene biosynthetic pathway in a manner similar to the 
related Streptomyces antibiotic furaquinocin (34). 

Subsequent phylogenetic analyses of the SSU rRNA genes of the marinone-produc- 
ing strains CNB-632 and CNH-099 revealed that they are closely related to each other 
but are significantly different from any known members of the Streptomycetaceae 
(Fig. 1). These strains, along with our strain CNQ-766, form a new marine clade 
within the family and appear to represent a new marine taxon. Thus, chemical studies 
leading to a highly unusual class of secondary metabolites inadvertently led to the 
discovery of a new actinomycete taxon, supporting our contention that there is a corre- 
lation between taxonomic and chemical novelty. 

Interestingly, neither CNB-632 nor CNH-099 requires seawater for growth. Although 
we have used this criterion reliably as a rapid method to recognize unique marine 
phylotypes, it is becoming increasingly clear that not all marine strains possess this 
trait (see Subheading 2.5.)- It is also of interest that related strains producing similar 
compounds were isolated over a 4-yr period. This has been our experience with other 
marine phylotypes, e.g., "Salinospora, " and is inconsistent with the concept that these 
actinomycetes are merely random strains originating as run-off from terrestrial sources. 
It also raises questions about biogeographical distributions and the ecological signifi- 
cance of compounds whose production appears to be maintained over time. 

2.5. Seawater Requirements 

The observation that non-seawater-requiring marine-derived strains can be phyloge- 
netically unique suggests that the requirement of seawater for growth, although clearly 
a marine adaptation, should not be considered a defining factor used to delineate 
marine from nonmarine actinomycetes. Marine and terrestrial environments are 
merely habitats from which actinomycetes can be recovered, and there is no reason to 
assume that these boundaries, as we perceive them, are as clearly delineated in the 
microbial world. As to what defines a marine actinomycete, it may not be wise to 
speculate on this subject, as any seemingly reasonable definition has a high probability 
of one day proving incomplete. It may be better instead to seek evidence that marine- 
derived actinomycetes are metabolically active or capable of growth in the marine envi- 
ronment. Other useful characteristics include the display of specific marine adaptations, 
the formation of associations with plants or invertebrates, and the demonstration of a 
role in nutrient cycling, all of which provide an indication that these bacteria are not 
merely present as dormant spores. When coupled with phylogenetic novelty, evidence 
for any of the above traits provides a strong case that the specific populations under 
study can be considered marine. 

In the case of actinomycetes, the persistence with which the requirement of seawater 
for growth is maintained for any specific population remains unclear, as does the time- 
scale within which this trait can be lost or acquired. In the case of "Salinospora " strains, 
they have thus far without exception maintained this physiological requirement even 
after more than 10 yr in culture. "Marinomyces" strains however appear to vary in this 
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trait, as only a portion of those initially isolated required seawater for growth and it is 
not yet clear whether they maintain this requirement after prolonged laboratory cul- 
ture. Thus, a better understanding of the role of sodium in actinomycete physiology, 
and the time frame within which a requirement of sodium for growth can be acquired, 
is warranted if we are to understand the evolutionary significance of this physiological 
trait. 

3. Discussion 

Although much excitement has been generated over the application of combinato- 
rial biology to the heterologous expression of biosynthetic genes harvested directly 
from nature, these methods remain in the exploratory stage and at present are no substi- 
tution for the direct cultivation of chemically prolific taxa. Although efforts to culture 
marine actinomycetes are only beginning, preliminary studies have been highly pro- 
ductive, leading to the isolation of new marine taxa and a high rate of new-compound 
discovery. Old concepts that the vast majority of environmental bacteria cannot be 
cultured are being rapidly disproved (e.g., ref. 35), and there can be little doubt that 
with the continued development of new culture techniques, untold new actinomycete 
diversity will become apparent. Continued cultivation efforts, along with culture- 
independent studies, will clarify how effectively total actinomycete diversity is being 
accessed in the search for new microbial products. 

We may now be witnessing the first glimpse of a major resurgence in natural prod- 
ucts as a source of new pharmaceutical agents. Chemical and Engineering News de- 
voted its October 2003 issue to this concept, speaking in detail about "rediscovering 
natural products" and the potential role of microorganisms in this process. Clearly, 
actinomycetes have the potential to once again play a major role in microbial drug 
discovery, and the recent observation that significant populations of marine-adapted 
actinomycetes occur in ocean sediments certainly emphasizes the potential importance 
of these strains in future discovery efforts. Given the advances made in analytical tech- 
niques, coupled with the potential for lead optimization using combinatorial synthesis 
and recombinant pathway expression, these tools now make it possible to capitalize on 
microbial products with an efficiency that was heretofore unimaginable. 
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Summary 

Endophytic microorganisms are found in virtually every higher plant on earth. These organisms 
reside in the living tissues of the host plant and do so in a variety of relationships, ranging from 
symbiotic to pathogenic. Endophytes may contribute to their host plant by producing a plethora of 
substances that provide protection and survival value to the plant. Ultimately, these compounds, once 
isolated and characterized, may also have potential for use in modern medicine. Novel antibiotics, 
antimycotics, immunosuppressants, and anticancer compounds are only a few examples of what has 
been found after the isolation and culturing of individual endophytes followed by purification and 
characterization of some of their natural products. The potential of finding new drugs that may be 
effective candidates for treating newly developing diseases in humans is great. 

Key Words: Munumbicins; kakadumycin; taxol; volatile antibiotics; streptomycetes; antican- 
cer agents; immunosuppressants. 

1. Introduction 

The need for new and useful compounds to provide assistance and relief in all aspects 
of the human condition is ever growing. Drug resistance in bacteria, the appearance of 
new life-threatening viruses, the recurrent problems of diseases in persons with organ 
transplants, and the tremendous increase in the incidence of fungal infections in the 
world's population all underscore our inadequacy to cope with these medical prob- 
lems. Environmental degradation, loss of biodiversity, and spoilage of land and water 
also add to problems facing humanity, and each of these in turn can have health-related 
consequences. 

Endophytes, microorganisms that reside in the tissues of living plants, are relatively 
unstudied as potential sources of novel natural products for exploitation in medicine. 
However, some of the most extensive and comprehensive work on natural products 
produced by endophytes has been done on the Neotyphodium sp. found on grasses (1). 
Alkaloids synthesized by this fungus in its grass hosts have been implicated in fescue 
toxicosis in rangeland animals (1). The chemistry and biology of this and other grass 
endophytes are reviewed elsewhere (2). Unfortunately, because this work is so com- 
prehensive, one may be led to the conclusion that endophytes produce toxic compounds 
only in their respective hosts and hold no promise for any medicinal applications what- 
soever (2). It turns out that this is simply not the case. As endophytes are examined 
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from a plethora of sources, an overwhelming number have been found to produce natu- 
ral products with promising potential for medicinal applications. 

Of the approx 300,000 higher plant species that exist on the earth, each individual 
plant, of the billions that exist here, is host to one or more endophytes. Only a handful 
of these plants (grass species) have ever been completely studied relative to their endo- 
phytic biology (2). Consequently, the opportunity to find new and interesting endo- 
phytic microorganisms among myriads of plants in different settings and ecosystems is 
very great. The intent of this review is to provide insights into their occurrence in 
nature, the products that they make, and indicate how some of these organisms are 
beginning to show some potential for human use. The majority of the report discusses 
rationale for study, methods used, and examples of a number of endophytes isolated 
and studied in the authors' laboratories over the course of many years. This review, 
however, also includes some specific examples that illustrate the work of others in this 
emerging field of bioprospecting the microbes of the world's rainforests. 

2. Needs for New Natural Products 

There is a general call for new antibiotics, and for chemotherapeutic agents that are 
highly effective and possess low toxicity. This search is driven by the development of 
resistance in infectious microorganisms (e.g., Staphylococcus, Mycobacterium, Strep- 
tococcus) to existing drugs and by the menacing presence of naturally resistant organ- 
isms. The ingress to the human population of new disease-causing agents such as 
acquired immunodeficiency syndrome (AIDS), Ebola, and severe acute respiratory syn- 
drome (SARS) requires the discovery and development of new drugs to combat them. 
Not only do diseases such as AIDS require drugs that target them specifically, but new 
therapies are needed for treating ancillary infections, which are a consequence of a 
weakened immune system. Furthermore, others who are immunocompromised (e.g., 
cancer and organ transplant patients) are at risk of infection by opportunistic patho- 
gens, such as Aspergillus, Cryptococcus, and Candida, which normally are not major 
problems in the human population. In addition, more drugs are needed to efficiently 
treat parasitic protozoan and nematodal infections such as malaria, leishmaniasis, 
trypanomiasis, and filariasis. Malaria by itself is more effective in claiming lives each 
year than any other single infectious agent with the exception of AIDS and tuberculosis 
(TB) (3). However, the enteric diseases claim the most lives each year of any disease 
complex, and unfortunately, the victims are mostly children (3). 

Novel natural products and the organisms that make them offer opportunities for 
innovation in drug discovery. Exciting possibilities exist for those who are willing to 
venture into the wild and unexplored territories of the world to experience the thrill of 
engaging in the discovery of endophytes, their biology, and potential usefulness. 

3. Endophytic Microbes 

It may also be true that a reduction in interest in natural products for use in drug 
development has happened as a result of people growing weary of dealing with the 
traditional sources of bioactive compounds, including plants of the temperate zones 
and microbes from a plethora of soil samples gathered in different parts of the world by 
armies of collectors. In other words, why continue to do the same thing when robots, 
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combinatorial chemistry, and molecular biology have arrived on the scene? Further- 
more, the logic and rationale for time and effort spent on drug discovery using a target- 
site-directed approach has been overwhelming. 

While combinatorial synthesis produces compounds at random, secondary metabo- 
lites, defined as low-molecular-weight compounds not required for growth in pure cul- 
ture, are produced as an adaptation for specific functions in nature (4). Shutz notes that 
certain microbial metabolites seem to be characteristic of certain biotopes, both on an 
environmental as well as organismal level (5). Accordingly, it appears that the search 
for novel secondary metabolites should center on organisms that inhabit unique 
biotopes. Thus, it behooves the investigator to carefully study and select the biological 
source before proceeding, rather than to take a totally random approach in selecting the 
source material. Careful study also indicates that organisms and their biotopes that are 
subjected to constant metabolic and environmental interactions should produce even 
more secondary metabolites (5). Endophytes are microbes that inhabit such biotopes, 
namely higher plants, which is why they are currently considered as a wellspring of 
novel secondary metabolites offering the potential for exploitation of their medical 
benefits. 

In addition, it also is extremely helpful for the investigator interested in exploiting 
endophytes to have access to, or have some expertise in, microbial taxonomy, and this 
includes modern molecular techniques involving sequence analyses of 16S and 18S 
rDNA. Currently, endophytes are viewed as an outstanding source of bioactive natural 
products because there are so many of them occupying literally millions of unique 
biological niches (higher plants) growing in so many unusual environments. Thus, it 
would appear that a myriad of biotypical factors associated with plants can be impor- 
tant in the selection of a plant for study. It may be the case that these factors may 
govern which microbes are present in the plant as well as the biological activity of the 
products associated with these organisms. 

Since the discovery of endophytes in Darnel, Germany, in 1904, various investiga- 
tors have defined endophytes in different ways, which usually depended on the per- 
spective from which the endophytes were being isolated and subsequently examined 
(6). Bacon et al. give an inclusive and widely accepted definition of endophytes: 
"Microbes that colonize living, internal tissues of plants without causing any imme- 
diate, overt negative effects" (2). While the symptomless nature of endophyte occupa- 
tion in plant tissue has prompted focus on symbiotic or mutualistic relationships 
between endophytes and their hosts, the observed biodiversity of endophytes suggests 
they can also be aggressive saprophytes or opportunistic pathogens. Both fungi and 
bacteria are the most common microbes existing as endophytes. It would seem that 
other microbial forms most certainly exist in plants as endophytes, such as mycoplas- 
mas, rickettsia, and archebacteria; however, no evidence for them has yet been pre- 
sented. The most frequently isolated endophytes are the fungi (7). It turns out that the 
vast majority of plants have not been studied for their endophytes. Thus, enormous 
opportunities exist for the recovery of novel fungal forms, including genera, biotypes, 
as well as species in the myriad of plants yet to be studied. Hawksworth and Rossman 
estimated there may be as many as 1 million different fungal species, yet only approx 
100,000 have been described (8). As more evidence accumulates, estimates keep rising 
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as to the actual number of fungal species. For instance, Dreyfuss and Chapela estimate 
there may be at least 1 million species of endophytic fungi alone (9). It seems obvious 
that endophytes are a rich and reliable source of genetic diversity and may represent 
previously undescribed species. Finally, in our experience, novel microbes (as defined 
at the morphological and/or molecular levels) often have novel natural products associ- 
ated with them. This fact alone helps eliminate the problems of dereplication in com- 
pound discovery. 

4. Rationale for Plant Selection 

It is important to understand the methods and rationale used seem to provide the best 
opportunities to isolate novel endophytic microorganisms at the genus, species, or bio- 
type level. Thus, since the number of plant species in the world is so great, creative and 
imaginative strategies must be used to quickly narrow the search for endophytes dis- 
playing bioactivity (10). 

A specific rationale for the collection of each plant for endophyte isolation and natu- 
ral product discovery is used. Several hypotheses govern this plant selection strategy, 
and these are as follows: 

1. Plants from unique environmental settings, especially those with an unusual biology, and 
possessing novel strategies for survival, are seriously considered for study. 

2. Plants that have an ethnobotanical history (use by indigenous peoples) that are related to 
the specific uses or applications of interest are selected for study. These plants are chosen 
either by direct contact with local peoples or via local literature. Ultimately, it may be 
learned that the healing powers of the botanical source, in fact, may have nothing to do 
with the natural products of the plant, but of the endophyte inhabiting the plant. 

3. Plants that are endemic, having an unusual longevity, or that have occupied a certain 
ancient land mass, such as Gondwanaland, are also more likely to lodge endophytes with 
active natural products than other plants. 

4. Plants growing in areas of great biodiversity, it follows, also have the prospect of housing 
endophytes with great biodiversity. 

Just as plants from a distinct environmental setting are considered to be a promising 
source of novel endophytes and their compounds, so too are plants with an unconven- 
tional biology. For example, an aquatic plant, Rhyncholacis penicillata, was collected 
from a river system in southwest Venezuela where the harsh aquatic environment sub- 
jected the plant to constant beating by virtue of rushing waters, debris, and tumbling 
rocks and pebbles (11). These environmental insults created many portals through 
which common phytopathogenic oomycetes could enter the plant. Still, the plant popu- 
lation appeared to be healthy, possibly owing to protection by an endophytic product. 
This was the environmental biological clue used to pick this plant for a comprehensive 
study of its endophytes. Eventually, an unusual and potent antifungal strain of Serratia 
marcescens, living both as an epiphyte and an endophyte, was recovered from 7?. 
penicillata. This bacterium was shown to produce oocydin A, a novel antioomycetous 
compound having the properties of a chlorinated macrocyclic lactone (Fig. 1) (11). It is 
conceivable that the production of oocydin A by S. marcescens is directly related to the 
endophyte 's relationship with its higher-plant host. Currently, oocydin A is being con- 
sidered for agricultural use to control the ever-threatening presence of oomyceteous 
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Fig. 1. Oocydin A, a chlorinated macrocyclic lactone isolated and characterized from a strain 
of Serratia marcescens, obtained from Rhyncholacis penicillata (stereochemistry is not known). 

fungi such as Pythium spp. and Phytophthora spp. Oocydin A also has activity against 
a number of rapidly dividing cancer cell lines (11). 

Plants with ethnobotanical history, as mentioned above, also are likely candidates 
for study, since the medical uses for which the plant was selected may relate more to its 
population of endophytes than to the plant biochemistry itself. For example, a sample 
of the snakevine, Kennedia nigriscans, from the Northern Territory of Australia, was 
selected for study since its sap has traditionally been used as bush medicine for many 
millenia. In fact, this area was selected for plant sampling because it has been home to 
the world's longest standing civilization — the Australian aborigines. The snakevine is 
harvested, crushed, and heated in an aqueous brew by local aborigines in southwest 
Arnhemland to treat cuts, wounds, and infections. As it turned out, the plant contained 
a streptomycete that possessed unique partial 16S rDNA sequences when compared to 
those in GenBank. The organism was designated Streptomyces NRRL 30562, and it 
produces broad-spectrum novel peptide antibiotics called munumbicins, which are dis- 
cussed below (12). It seems likely that some of the healing properties in plants, as 
discovered by indigenous peoples, might be facilitated by compounds produced by one 
or more specific plant-associated endophytes as well as the plant products themselves. 

In addition, it is worthy to note that some plants generating bioactive natural prod- 
ucts have associated endophytes that produce the same natural products. Such is the 
case with taxol, a highly functionalized diterpenoid and famed anticancer agent that is 
found in each of the world's yew tree species (Taxus spp.) (11,12). In 1993, a novel 
taxol-producing fungus, Taxomyces andreanae, from the yew Taxus brevifolia, was 
isolated and characterized (13). 

5. Endophytes and Biodiversity 

Of the myriad of ecosystems on earth, those having the greatest general biodiversity 
seem to be the ones also having the greatest number and most diverse endophytes. 
Tropical and temperate rainforests are the most biologically diverse terrestrial ecosys- 
tems on earth. The most threatened of these spots cover only 1.44% of the land's sur- 
face, yet they harbor over 60% of the world's terrestrial biodiversity (10). In addition, 
each of the 20-25 areas identified as supporting the world's greatest biodiversity sup- 
port unusually high levels of plant endemism (10). As such, one would expect, with 
high plant endemism, there also should exist specific endophytes that may have evolved 
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with the endemic plant species. Biological diversity implies chemical diversity, 
because of the constant chemical innovation that is required to survive in ecosys- 
tems where the evolutionary race to survive is most active. Tropical rainforests are a 
remarkable example of this type of environment. Competition is great, resources are 
limited, and selection pressure is at its peak. This gives rise to a high probability that 
rainforests are a source of novel molecular structures and biologically active com- 
pounds (14). 

Bills et al. describe a metabolic distinction between tropical and temperate endo- 
phytes through statistical data that compare the number of bioactive natural products 
isolated from endophytes of tropical regions to the number of those isolated from endo- 
phytes of temperate origin (15). Not only did they find that tropical endophytes provide 
more active natural products than temperate endophytes, but they also noted that a 
significantly higher number of tropical endophytes produced a larger number of active 
secondary metabolites than did fungi from other substrata. This observation suggests 
the importance of the host plant as well as the ecosystem in influencing the general 
metabolism of endophytic microbes. 

6. Endophytes and Phytochemistry 

Tan and Zou believe the reason why some endophytes produce certain phytochem- 
icals, originally characteristic of the host, might be related to a genetic recombination 
of the endophyte with the host that occurred in evolutionary time (6). This is a concept 
that was originally proposed as a mechanism to explain why T. andreanae may be 
producing taxol (16). Thus, if endophytes can produce the same rare and important 
bioactive compounds as their host plants, this would not only reduce the need to har- 
vest slow-growing and possibly rare plants, but also help to preserve the world's ever- 
diminishing biodiversity. Furthermore, it is recognized that a microbial source of a 
high-value product may be easier and more economical to produce effectively, thereby 
reducing its market price. 

All aspects of the biology and interrelatedness of endophytes with their respective 
hosts is a vastly under-investigated and exciting field (17,18). Thus, more background 
information on a given plant species and its microorganismal biology would be exceed- 
ingly helpful in directing the search for bioactive products. Presently, no one is quite 
certain of the role of endophytes in nature and their relationship to various host plant 
species. Although some endophytic fungi appear to be ubiquitous (e.g., Fusarium spp., 
Pestalotiopsis spp., and Xylaria spp.), one cannot definitively state that endophytes are 
truly host-specific or even systemic within plants, any more than one can assume that 
their associations are chance encounters. Frequently, many endophytes of the same 
species are isolated from the same plant, and only one or a few biotypes of a given 
fungus will produce a highly biologically active compound in culture (19). A great deal 
of uncertainty also exists between what an endophyte produces in culture and what it 
may produce in nature. It does seem possible that the production of certain bioactive 
compounds by the endophyte in situ may facilitate the domination of its biological 
niche within the plant or even provide protection to the plant from harmful invading 
pathogens. Furthermore, little information exists relative to the biochemistry and physi- 
ology of the interactions of the endophyte with its host plant. It would seem that many 
factors changing in the host, related to the season, age, environment, and location, may 
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influence the biology of the endophyte. Indeed, further research at the molecular level 
must be conducted in the field to study endophyte interactions and ecology. All of 
these interactions are probably chemically mediated for some purpose in nature. An 
ecological awareness of the role these organisms play in nature will provide the best 
clues for targeting particular types of endophytic bioactivity with the greatest potential 
for bioprospecting. 

7. Collection, Isolation, and Preservation of Endophytes 

After a plant is selected for study, it is identified, and its location is plotted using a 
global positioning device. Small stem pieces are cut from the plant and placed in sealed 
plastic bags after excess moisture is removed. Every attempt is made to store the mate- 
rials at 4°C until isolation procedures can begin (20,21). 

In the laboratory, the surfaces of plant materials are thoroughly treated with 70% 
ethanol, sometimes flamed, and ultimately they are air dried under a laminar-flow hood. 
This is done in order to eliminate surface-contaminating microbes (20). Then, with a 
sterile knife blade, outer tissues are removed from the samples and the inner tissues 
carefully excised and placed on water agar plates. After several days of incubation, 
hyphal tips of the fungi are removed and transferred to potato dextrose or other suitable 
agar. Bacterial forms also emerge from the plant tissues, including, on rare occasions, 
certain Streptomyces spp. The endophytes are encouraged to sporulate on specific plant 
materials and are eventually identified via standard morphological and molecular bio- 
logical techniques and methods. Eventually, when an endophyte is acquired in pure 
culture, it is tested for its ability to be grown in shake or still culture using various 
media and growth conditions (21). It is also immediately placed in storage under vari- 
ous conditions including 15% glycerol at -70°C. Ultimately, once appropriate growth 
conditions are found, the microbe is subjected to fermentation, extraction, and the 
bioactive compounds are isolated and characterized. Virtually all of the common and 
advanced procedures for product isolation and characterization are utilized in order to 
acquire the product(s) of interest. Central to the processes of isolation is the establish- 
ment of one or more bioassays that will guide the compound purification processes. 
One cannot put too much emphasis on this point, since the ultimate success of any 
natural-product isolation activity is directly related to the development or selection of 
appropriate bioassay procedures. These can involve target organisms, enzymes, tis- 
sues, or model chemical systems that relate to the purpose for which the new com- 
pound is needed. 

8. Natural Products From Endophytic Microbes 

The following section shows some examples of natural products obtained from endo- 
phytic microbes and their potential in the pharmaceutical and agrochemical arenas. Many 
of the examples are taken from our work, and thus, this review is by no means inclusive 
of all natural-product work in endophytes. 

8.1. Endophytic Fungal Products As Antibiotics 

Fungi are the most commonly isolated endophytic microbes. They usually appear as 
fine filaments growing from the plant material on the agar surface. Generally, the most 
commonly isolated fungi are in the group Fungi imperfecti or Deuteromycetes. Basi- 
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cally, they produce asexual spores in or on various fruiting structures. Also, it is quite 
common to isolate endophytes that are producing no fruiting structures whatsoever, 
such as Mycelia sterilia. Quite commonly endophytes do produce secondary metabo- 
lites when placed in culture. However, the temperature, the composition of the medium, 
and the degree of aeration will affect the amount and kind of compounds that are pro- 
duced. Sometimes endophytic fungi produce antibiotics. Natural products from endo- 
phytic fungi have been observed to inhibit or kill a wide variety of harmful 
microorganisms including, but not limited to, phytopathogens, as well as bacteria, 
fungi, viruses, and protozoans that affect humans and animals. Described below are 
some examples of bioactive products from endophytic fungi. 

Cryptosporiopsis cf. quercina is the imperfect stage of Pezicula cinnamomea, a fun- 
gus commonly associated with hardwood species in Europe. It was isolated as an endo- 
phyte from Tripterigeum wilfordii, a medicinal plant native to Eurasia (22). On Petri 
plates, C quercina demonstrated excellent antifungal activity against some important 
human fungal pathogens, including Candida albicans and Trichophyton spp. A unique 
peptide antimycotic, termed "cryptocandin," was isolated and characterized (22). This 
compound contains a number of peculiar hydoxylated amino acids and a novel amino 
acid, 3-hydroxy-4-hydroxy methyl proline (Fig. 2). The bioactive compound is related 
to known antimycotics — the echinocandins and the pneumocandins (23). As is gener- 
ally true, not one but several bioactive and related compounds are produced by an 
endophytic microbe. Thus, other antifungal agents related to cryptocandin are also pro- 
duced by C. quercina. Cryptocandin is also active against a number of plant pathogenic 
fungi, including Sclerotinia sclerotiorum and Botrytis cinerea. Cryptocandin and its 
related compounds are currently being considered for use against a number of fungi 
causing diseases of the skin and nails. 

Cryptocin, a unique tetramic acid, is also produced by C. quercina (discussed previ- 
ously) (Fig. 3)(24). This unusual compound possesses potent activity against 
Pyricularia oryzae, the causal organism of one of the worst plant diseases in the world, 
as well as a number of other plant pathogenic fungi (24). The compound was generally 
ineffective against a general array of human pathogenic fungi. Nevertheless, with mini- 
mum inhibitory concentrations against P. oryzae at 0.39 ug/mL, this compound is being 
examined as a natural chemical control agent for rice blast and is being used as a platform 
for the synthesis of other antifungal compounds. 

As mentioned earlier, P. microspora is a common rainforest endophyte (17-20). It 
turns out that enormous biochemical diversity does exist in this endophytic fungus, and 
many secondary metabolites are produced by various strains of this widely dispersed 
organism. One such secondary metabolite is ambuic acid, an antifungal agent, which 
has been recently described from several isolates of P. microspora found as representa- 
tive isolates in many of the world's rainforests (Fig. 4) (25). This compound as well as 
another endophyte product, terrein, have been used as models to develop new solid- 
state nuclear magnetic resonance (NMR) tensor methods to assist in the characteriza- 
tion of the molecular stereochemistry of organic molecules. 

A strain of P. microspora was also isolated from the endangered tree Torreya 
taxifolia and produced several compounds having antifungal activity, including 
pestaloside, an aromatic (3-glucoside (Fig. 5), and two pyrones — pestalopyrone and 
hydroxypestalopyrone (26). These products also possess phytotoxic properties. Other 



Rainforest Endophytes 



337 



OH 



Glutamine 



OH 



HO 

3 -Hydroxy-4-hydroxy 
methyl-proline 



3 ,4-Dihydroxy-homotryosine 



4-Hydroxyproline 
OH 




Threonine 



OH 

4,5-Dihydroxy 
ornithine : 



NH 



(CH 2 ). 



14 
Palmitic acid 



Fig. 2. Cryptocandin A, an antifungal lipopeptide obtained from the endophytic fungus 
Cryptosporiopsis cf. quercina (no stereochemistry is intended). 




Fig. 3. Cryptocin, a tetramic acid antifungal compound found in Cryptosporiopsis cf. 
quercina. 
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Fig. 4. Ambuic acid, a highly functionalized cyclohexenone produced by a number of isolates 
of Pestcilotiopsis microspom found in rainforests around the world. This compound possesses 
antifungal activity and has been used as a model compound for the development of solid-state 
nuclear magnetic resonance methods for the structural determination of natural products. 
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Fig. 5. Pestaloside, a glucosylated aromatic compound with antifungal properties from 
Pestalotiopsis microspora. 



newly isolated secondary products obtained from P. microspora (endophytic on Taxus 
brevifolia) include two new caryophyllene sesquiterpenes — pestalotiopsins A and B 
(27). Additional new sesquiterpenes produced by this fungus are 2oc-hydroxydimeninol 
and a highly functionalized humulane (28,29). Variation in the amount and kinds of 
products found with this fungus depends on both the cultural conditions as well as the 
original plant source from which it was isolated. 

Pestalotiopsis jesteri is a newly described endophytic fungal species from the 
Sepik River area of Papua New Guinea, and it produces jesterone and hydroxy- 
jesterone, which exhibit antifungal activity against a variety of plant pathogenic fungi 
(30). These compounds are highly functionalized cyclohexenone epoxides. Jesterone, 
subsequently, has been prepared by organic synthesis with complete retention of bio- 
logical activity (Fig. 6) (31). Jesterone is one of only a few products from endophytic 
microbes in which total synthesis of a bioactive product has been successfully 
accomplished. 
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Fig. 6. Jesterone, a cyclohexenone epoxide from Pestciliotiopsis jesteri, has antioomycete 
activity. 



Phomopsichalasin, a metabolite from an endophytic Phomopsis sp., represents the 
first cytochalasin-type compound with a three-ring system replacing the cytochalasin 
macrolide ring. This metabolite exhibits antibacterial activity in disk diffusion assays 
(at a concentration of 4 ug/disk) against Bacillus subtilis, Salmonella gallinarum, and 
Staphylococcus aureus. It also displays moderate activity against the yeast Candida 
tropicalis (32). 

An endophytic Fusarium sp. from the plant Selaginella pallescens, collected in the 
Guanacaste Conservation Area of Costa Rica, was screened for antifungal activity. A 
new pentaketide antifungal agent, CR377, was isolated from the culture broth of the 
fungus and showed potent activity against C. albicans in agar diffusion assays (33). 

Colletotric acid, a metabolite of Colletotrichum gloeosporioides, an endophytic fun- 
gus isolated from Artemisia mongolica, displays activity against bacteria as well as 
against the fungus Helminthsporium sativum (34). Another Colletotrichum sp., iso- 
lated from Artemisia annua, produces antimicrobial metabolites as well. A. annua is a 
traditional Chinese herb that is well recognized for its synthesis of artemisinin (an 
antimalarial drug) and its ability to inhabit many geographically different areas. Not 
only did the Colletotrichum sp. found in A. annua produce metabolites with activity 
against human pathogenic fungi and bacteria, but also metabolites that were fungistatic 
to plant pathogenic fungi (35). 

8.2. Endophytic Bacterial Products As Antibiotics 

There are only a limited number of bacterial species known to be associated with 
plants, and one of the most common is Pseudomonas spp. Pseudomonas has represen- 
tative biotypes and species that are epiphytic, endophytic, and pathogenic. They have 
been reported from every continent including the Antarctic. Some of these species pro- 
duce phytotoxic compounds as well as antibiotics. The ecomycins are produced by 
Pseudomonas viridiflava (36). This bacterium is generally associated with the leaves 
of many grass species and is located on and within the tissues (36). The ecomycins 
represent a family of novel lipopeptides and have masses of 1153 and 1181. Besides 
common amino acids such as alanine, serine, threonine, and glycine, some nonprotein 
amino acids are incorporated into the structure of the ecomycins, including homoserine 
and (3-hydroxyaspartic acid (36). The ecomycins are active against such human patho- 
genic fungi as Cryptococcus neoformans and C. albicans. 
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The pseudomycins produced by a plant-associated pseudomonad are another group 
of antifungal peptides (37,38). They are active against a variety of plant and human 
pathogenic fungi, including Candida albicans, Cryptococcus neoformans, and a vari- 
ety of plant pathogenic fungi, including Ceratocystis ulmi (the Dutch Elm disease 
pathogen) and Mycosphaerella fijiensis (causal agent of Black Sigatoka disease in ba- 
nanas). The pseudomycins are cyclic depsipeptides formed by acylation of the OH 
group of the N-terminal serine with the terminal carboxyl group of L-chlorothreonine. 
Variety in this family of compounds is imparted via ,/V-acetylation by one of a series of 
fatty acids, including 3,4-dihydroxydecanoate, 3-hydroxy-tetradecanoate (38). The 
pseudomycins contain several nontraditional amino acids, including L-chlorothreonine, 
L-hydroxyaspartic acid, and both d- and L-diaminobutryic acid. The molecules are candi- 
dates for use in human medicine, especially after structural modification by chemical 
synthesis has successfully eliminated mammalian toxicity (39) The pseudomycins are 
also effective against a number of ascomycetous fungi, and are being considered for 
agricultural use for the control of the Black Sigatoka disease in bananas (Strobel, 
unpublished). 

8.3. Endophytic Streptomycetes As Antibiotic Producers 

Streptomyces spp. are filamentous bacteria, belonging to the order Actinomycetales, 
that live in widely diverse ecological settings. Generally, this group is Gram positive, 
has a high G+C content, and does not have an organized nucleus. To date, actino- 
mycetes have been the world's greatest source of natural antibiotics (40). In fact, just 
one genus, Streptomyces, is the source of 80% of these compounds. The majority of the 
antibiotic producers are from soil sources, and until recently it was not realized that 
these organisms can exist as endophytes. One of the first endophytic Streptomyces spp. 
isolated was that from Lolium perenne, a grass species (41). This isolate produces a 
diketopiperazine that is a weak antibiotic and has been designated "methylalbonoursin" 
(41). 

Using the ethnobotanical approach to plant selection, the snakevine plant, K. 
nigriscans, was chosen as a possible source of endophytic microbes because of its 
long-held traditional use by Australian aborigines to treat cuts and open wounds, result- 
ing in reduced infection and rapid healing. This plant was collected near the Aboriginal 
Community of Manyallaluk in Northern Territory, Australia, and consistently yielded 
an endophytic actinomycete designated Streptomyces NRRL 30562 (12). The organ- 
ism was not found in several tree species supporting the vine, suggesting a host-selec- 
tive or -specific association of the endophyte with a specific plant genus. This 
streptomycete produces a family of extremely potent peptide antibiotics, and these com- 
pounds may not only protect the plant from fungal and bacterial infections, but also 
have unknowingly served the aborigines as a source of bush medicine. 

The antibiotics produced by Streptomyces NRRL 30562, called "munumbicins," 
possess widely differing biological activities, depending on the target organism. In 
general, the munumbicins demonstrate activity against Gram-positive bacteria such as 
Bacillus anthracis and multidrug-resistant Mycobacterium tuberculosis, as well as a 
number of other drug-resistant bacteria. However, the most impressive biological activ- 
ity of any of the munumbicins is that of munumbicin D against the malarial parasite 
Plasmodium falciparum, having an IC 50 of 4.5 + 0.07 ng/mL (12). The munumbicins 
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are highly functionalized peptides, each containing threonine, aspartic acid (or aspar- 
agine), and glutamic acid (or glutamine). Since the peptides are yellowish orange, they 
also contain one or more chromophoric groups, whose structures have not been deter- 
mined. Their masses range from 1269 to 1326 Da. The isolation of this endophytic 
streptomycete represents an important finding, providing one of the first examples of 
plants serving as reservoirs of actinomycetes. More than 40 of these endophytic strep- 
tomycetes, now in hand in our laboratory, possess antibiotic activity (Castillo, U., 
Strobel, G.A., unpublished data). Endophytic actinomycetes are now being tested and 
considered for use in controlling plant diseases (42). 

Another endophytic Streptomyces sp. (NRRL 30566), from a fern-leaved grevillea 
(Grevillea pteridifolia) tree growing in the Northern Territory of Australia, produces 
novel antibiotics called "kakadumycins," which are related to the echinomycins (43). 
Each of these antibiotics contains alanine, serine, and an unknown amino acid. 
Kakadumycin A has wide-spectrum antibiotic activity similar to that of munumbicin 
D, especially against Gram-positive bacteria, and it generally displays better bioactiv- 
ity than echinomycin. For instance, against B. anthracis strains, kakadumycin A has 
MICs of 0.2-0.3 ug/ mL, in contrast to echinomycin at 1.0-1.2 ug/mL. Both 
echinomycin and kakadumycin A have impressive activity against P. falciparum, with 
LD 50 s in the range of 7-10 ng/mL (43). Kakadumycin A and echinomycin are related 
by virtue of their very similar structures (amino acid content and quinoxoline rings), 
but differ slightly with respect to their elemental compositions, aspects of their spectral 
qualities, chromatographic retention times, and biological activities (53). 

Echinomycin and kakadymycin A were studied as inhibitors of macromolecular syn- 
thesis, with control substances such as ciprofloxacin, rifampin, chloramphenicol, and 
vancomycin used as standards with well-established modes of action. Tests were done 
for DNA, RNA, protein, and cell-wall synthesis inhibition activities, respectively. 
Kakadumycin A significantly inhibited RNA synthesis in B. subtilis (43). Kakadumycin 
A also inhibited protein synthesis and cell-wall synthesis substantially, but had a lower 
effect on DNA synthesis. Kakadumycin A shares a very similar inhibitory profile with 
echinomycin in four macromolecular synthesis assays. Kakadumycin A preferentially 
inhibits RNA synthesis, and may have the same mode of action as echinomycin, which 
inhibits RNA synthesis by binding to a DNA template (53). 

More recently, endophytic streptomycetes have been discovered in an area of the 
world claimed to be one of the most biologically diverse — the upper Amazon of Peru. 
The inner tissues of the follow me vine, Monstera sp., commonly yielded a verticillated 
streptomycete with outstanding inhibitory activities against pythiaceous fungi as well 
as the malarial parasite Plasmodium falciparum. The bioactive component is a mixture 
of lipopeptides named "coronamycins" (44). 

8.4. Antiviral Compounds 

Another fascinating use of products from endophytic fungi is the inhibition of vi- 
ruses. Two novel human cytomegalovirus (hCMV) protease inhibitors, cytonic acids A 
and B, have been isolated from solid-state fermentation of the endophytic fungus 
Cytonaema sp. Their structures were elucidated as /?-tridepside isomers by MS and 
NMR methods (45). It is apparent that the potential for the discovery of compounds 
having antiviral activity from endophytes is in its infancy. The main limitation to com- 
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pound discovery to date is probably related to the absence of common antiviral screen- 
ing systems in most compound-discovery programs. 

8.5. Volatile Antibiotics From Endophytes 

Muscodor albus is a newly described endophytic fungus obtained from small limbs 
of Cinnamomum zeylanicum (cinnamon tree) (46). This xylariaceaous (non-spore pro- 
ducing) fungus effectively inhibits and kills certain fungi and bacteria by producing a 
mixture of volatile compounds (47). The majority of these compounds have been iden- 
tified by gas chromatography (GC)/MS, synthesized or acquired, and then formulated 
into an artificial mixture. This mixture not only mimicked the antibiotic effects of the 
volatile compounds produced by the fungus, but also was used to confirm the identity 
of the majority of the volatiles emitted by this organism (47). Each of the five classes of 
volatile compounds produced by the fungus had some microbial effects against the test 
fungi and bacteria, but none was lethal. However, they acted synergistically to cause 
death in a broad range of plant and human pathogenic fungi and bacteria. The most 
effective class of inhibitory compounds was the esters, of which isoamyl acetate was 
the most biologically active. The composition of the medium on which M. albus grows 
dramatically influences the kind of volatile compounds that are produced (48). The 
ecological implications and potential practical benefits of the "mycofumigation" effects 
of M. albus are very promising, given the fact that soil fumigation utilizing methyl bro- 
mide will soon be illegal in the United States. Methyl bromide is not only a hazard to 
human health, but it has been implicated in causing destruction of the ozone layer. The 
potential use of mycofumigation to treat soil, seeds, and plants may soon be a reality. 
The artificial mixture of volatile compounds may also have usefulness in treating seeds, 
fruits, and plant parts in storage and while being transported. Muscodor albus already 
has a limited market for the treatment of human wastes. Its gases have both inhibitory 
and lethal effects on such fecal-inhabiting organisms as Escherichia coli and Vibrio 
cholera. 

Using M. albus as a screening tool, it has now been possible to isolate other endo- 
phytic fungi producing volatile antibiotics. The newly described M. roseus was obtained 
twice from tree species growing in the Northern Territory of Australia. This fungus is 
just as effective in causing inhibition and death of test microbes in the laboratory as M. 
albus (49). In addition, for the first time, a nonmuscodor species (Gliocladium sp.) was 
discovered as a producer of volatile antibiotics. The volatile components of this organ- 
ism are totally different from those of either M. albus or M. roseus. In fact, the most 
abundant volatile inhibitor is [8]-annulene, formerly used as a rocket fuel and discov- 
ered for the first time as a natural product. However, the bioactivity of the volatiles of 
this Gliocladium sp. is not as good or comprehensive as those from Muscodor spp. 
(21,47). 

8.6. Endophytic Fungal Products As Anticancer Agents 

Taxol and some of its derivatives represent the first major group of anticancer agents 
that are produced by endophytes (Fig. 6). Taxol (Fig. 7), a highly functionalized 
diterpenoid, is found in each of the world's yew (Taxus) species, but was originally 
isolated from Taxus brevifolia (50). The original targets for this compound were ova- 
rian and breast cancers, but now it is used to treat a number of other human tissue 
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Fig. 7. Taxol, the world's first billion-dollar anticancer drug, is produced by many endo- 
phytic fungi. It too, possesses outstanding anti-oomycete activity. 



proliferating diseases as well. The presence of taxol in yew species prompted the study 
of their endophytes. By the early 1990s, however, no endophytic fungi had been iso- 
lated from any of the world's representative yew species. After several years of effort, 
a novel taxol-producing endophytic fungus, Taxomyces andrecmae, was discovered in 
Taxus brevifolia (13). The most critical line of evidence for the presence of taxol in the 
culture fluids of this fungus was the electrospray mass spectrum of the putative taxol 
isolated from T. andreanae. In electrospray mass spectroscopy, taxol usually gives two 
peaks — one at mass 854, which is M+H + , and the other at 876, which is M+Na + . Fun- 
gal taxol had a mass spectrum identical to authentic taxol (16). Then, 14 C labeling 
studies showed the presence of fungal-derived taxol in the culture medium (26). This 
early work set the stage for a more comprehensive examination of the ability of other 
Taxus species and many other plants to yield endophytes producing taxol. 

Some of the most commonly found endophytes of the world's yews and many other 
plants are Pestalotiopsis spp. (17-20). One of the most frequently isolated endophytic 
species is Pestalotiopsis microspora (17). An examination of the endophytes of Taxus 
wallichiana yielded P. microspora, and a preliminary monoclonal antibody test indi- 
cated that it might produce taxol (20). After preparative TLC, a compound was isolated 
and shown by spectroscopic techniques to be taxol. Labeled ( 14 C) taxol was produced 
by this organism from several 14 C precursors (20). Furthermore, several other P. 
microspora isolates that produce taxol were obtained from a bald cypress tree in South 
Carolina (19). This was the first indication that endophytes, residing in plants other 
than Taxus spp., produce taxol. Therefore, a specific search was conducted for taxol- 
producing endophytes on continents not known for any indigenous Taxus spp., e.g., 
South America and Australia. From the extremely rare, and previously thought to be 
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extinct, Wollemi pine (Wollemia nobilis), Pestalotiopsis guepini was isolated, which 
was shown to produce taxol (51). Also, quite surprisingly, a rubiaceous plant, 
Maguireothamnus speciosus, yielded a novel fungus, Seimatoantlerium tepuiense, that 
produces taxol. This endemic plant grows on the top of the tepuis in the Venzuelan- 
Guyana border in southwest Venezuela (52). Furthermore, fungal taxol production has 
also been noted in Periconia sp. (53) and Seimatoantlerium nepalense, another novel 
endophytic fungal species (54). Simply, it appears that the distribution of taxol-making 
fungi is worldwide and is not confined to endophytes of yews. The ecological and 
physiological explanation for fungi making taxol seems to be related to the fact that 
taxol is a fungicide, and the organisms most sensitive to it are plant pathogens such as 
Pythium spp. and Phytophthora spp. (55). These pythiaceous organisms are some of 
the world's most important plant pathogens and are strong competitors with endophytic 
fungi for niches within plants. In fact, their sensitivity to taxol is based on their interac- 
tion with tubulin, in an identical manner as in rapidly dividing human cancer cells (55). 
Thus, bona fide endophytes may be producing taxol and related taxanes to protect their 
respective host plant from degradation and disease caused by these pathogens. 

Other investigators have also made observations on taxol production by endophytes, 
including the discovery of taxol production by Tubercularia sp. isolated from the Chi- 
nese yew (Taxus mairei) in the Fujian province of southeastern mainland China (56). 
At least three endophytes of Taxus wallichiana produce taxol, including Sporormia 
minima and Trichothecium sp. (57). Using HPLC and ESIMS, taxol has been discov- 
ered in Corylus avellana cv. Gasaway (58). Several fungal endophytes of this plant 
(filbert) produce taxol in culture (58). It is important to note, however, that taxol pro- 
duction by all endophytes in culture is in the range of sub-micrograms to micrograms 
per liter. Also, commonly, the fungi will attenuate taxol production in culture, with 
some possibility for recovery, if certain activator compounds are added to the medium 
(53). Efforts are being made to determine the feasibility of making microbial taxol a 
commercial possibility, e.g., the discovery of endophytes that make large quantities of 
one or more taxanes that could then be used as intermediates for the organic synthesis 
of taxol or one of its anticancer relatives. 

Torreyanic acid, a selectively cytotoxic quinone dimer and potential anticancer 
agent, was isolated from a P. microspora strain (Fig. 8). This strain was originally 
obtained as an endophyte associated with the endangered tree Torreya taxifolia (Florida 
torreya) (59). Torreyanic acid was tested in several cancer cell lines, and it demon- 
strated 5 to 10 times more potent cytotoxicity in lines that are sensitive to protein 
kinase C agonists; it causes cell death by apoptosis. Recently, torreyanic acid has 
been successfully synthesized by a biomimetic oxidation/dimerization cascade (60). 

Alkaloids are also commonly found in endophytic fungi. Fungal genera such as 
Xylaria, Phoma, Hypoxylon, and Chalara are representative producers of a relatively 
large group of substances known as the cytochalasins, of which more than 20 are now 
known. Many of these compounds possess antitumor and antibiotic activities, but 
because of their cellular toxicity they have not been developed into pharmaceuticals. 
Three novel cytochalasins have recently been reported from Rhinocladiella sp., as an 
endophyte on Tripterygium wilfordii. These compounds have antitumor activity and 
have been identified as 22-oxa-[12]-cytochalasins (61). Thus, it is not uncommon to 
find one or more cytochalasins in endophytic fungi, and this provides an example of 
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Fig. 8. Torreyanic acid, an anticancer compound, from Pestalotiopsis microspora. 



the fact that redundancy in discovery does occur, making dereplication an issue even 
for these under-investigated sources. 

8.7. Products From Endophytes As Antioxidants 

Two compounds, pestacin and isopestacin, have been obtained from culture fluids 
of Pestalotiopsis microspora, an endophyte isolated from a combretaceaous plant, 
Terminalia morobensis, growing in the Sepik River drainage system of Papua New 
Guinea (62,63). Both pestacin and isopestacin display antimicrobial as well as antioxi- 
dant activity. Isopestacin was attributed with antioxidant activity based on its struc- 
tural similarity to the flavonoids (Fig. 9). Electron spin resonance spectroscopy 
confirmed this antioxidant activity; the compound is able to scavenge superoxide and 
hydroxyl free radicals in solution (62). Pestacin was later described from the same 
culture fluid, occurring naturally as a racemic mixture and also possessing potent anti- 
oxidant activity (Fig. 10) (63). The proposed antioxidant activity of pestacin arises 
primarily via cleavage of an unusually reactive C-H bond and, to a lesser extent, through 
O-H abstraction (63). The antioxidant activity of pestacin is at least one order of mag- 
nitude more potent than that of trolox, a vitamin E derivative (63). 

8.8. Antidiabetic Agents From Rainforest Fungi 

A nonpeptidal fungal metabolite (L-783,281) was isolated from an endophytic fun- 
gus (Pseudomassaria sp.) collected from an African rainforest near Kinshasa in the 
Democratic Republic of the Congo (64). This compound acts as an insulin mimetic but, 
unlike insulin, is not destroyed in the digestive tract and may be given orally. Oral 
administration of L-783,281 in two mouse models of diabetes resulted in significant 
lowering of blood glucose levels. These results may lead to new therapies for diabetes. 

8.9. Immunosuppressive Compounds From Endophytes 

Immunosuppressive drugs are used today to prevent allograft rejection in transplant 
patients, and in the future they could be used to treat autoimmune diseases such as 
rheumatoid arthritis and insulin-dependent diabetes. The endophytic fungus Fusarium 
subglutinans, isolated from T. wilfordii, produces the immunosuppressive but 
noncytotoxic diterpene pyrones subglutinols A and B (Fig. 11) (65). Subglutinol A and 
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Fig. 9. Isopestacin, an antioxidant produced by an endophytic Pestalotiopsis microspora 
strain, isolated from Terminalia morobensis growing on the north coast of Papua New Guinea. 




Fig. 10. Pestacin is also produced by Pestalotiopsis microspora, and it too is an antioxidant. 



B are equipotent in the mixed lymphocyte reaction (MLR) assay and thymocyte pro- 
liferation (TP) assay, with an IC 50 of 0.1 uM. In the same assay systems, the famed 
immunosuppressant drug cyclosporin A, also a fungal metabolite, was roughly as potent 
in the MLR assay and 10 4 more potent in the TP assay. Still, the lack of toxicity asso- 
ciated with subglutinols A and B suggests that they should be explored in greater detail 
as potential immunosuppressants (65). 

9. Surprising Results From Molecular Biological Studies 
on Pestalotiopsis microspora 

Of some compelling interest is an explanation as to how the genes for taxol produc- 
tion may have been acquired by P. microspora (66). Although the complete answer to 
this question is not at hand, relevant genetic studies have been performed on this organ- 
ism. P. microspora Ne 32 is one of the most easily genetically transformable fungi that 
have been studied to date. In vivo addition of telomeric repeats to foreign DNA gener- 
ates extrachromosomal DNAs in this fungus (66). Repeats of the telomeric sequence 
5'-TTAGGG-3' were appended to nontelomeric transforming DNA termini. The new 
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Fig. 11. Subglutinol A, an immunosuppressant, is produced by an endophytic Fusarium 
subglutinans strain. 



DNAs, carrying foreign genes and the telomeric repeats, replicated independently of 
the chromosome and expressed the information carried by the foreign genes. The addi- 
tion of telomeric repeats to foreign DNA is unusual among fungi. This finding may 
have important implications in the biology of P. microspora Ne 32, because it explains 
at least one mechanism as to how new DNA can be captured by this organism and 
eventually expressed and replicated. Such a mechanism may begin to explain how the 
enormous biochemical variation may have arisen in this fungus (19). Also, this initial 
work represents a framework to aid in the understanding of how this fungus may adapt 
itself to the environment of its plant hosts and suggests that the uptake of plant DNA into 
its own genome may occur. In addition, the telomeric repeats have the same sequence as 
human telomeres, and this points to the possibility that P. microspora may serve as a 
means to make artificial human chromosomes, a totally unexpected result. 

10. Conclusion 

Endophytes are a poorly investigated group of microorganisms that represent an 
abundant and dependable source of bioactive and chemically novel compounds with 
potential for exploitation in a wide variety of medical applications. The mechanisms 
through which endophytes exist and respond to their surroundings must be better 
understood in order to be more predictive about which higher plants to seek, study, 
and employ in isolating microfloral components. This may facilitate the natural-prod- 
uct discovery process. 

Although work on the utilization of this vast resource of poorly understood microor- 
ganisms has just begun, it has already become obvious that an enormous potential for 
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organism, product, and utilitarian discovery in this field holds exciting promise. This is 
evidenced by the discovery of a wide range of products, and microorganisms that 
present potential. It is important for all involved in this work to realize the importance 
of acquiring the necessary permits from governmental, local, and other sources to pick 
and transport plant materials (especially from abroad) from which endophytes are to be 
eventually isolated. In addition to this aspect of the work is the added activity of pro- 
ducing the necessary agreements and financial sharing arrangements with indigenous 
peoples or governments in case a product does develop an income stream. 

Certainly, one of the major problems facing the future of endophyte biology and 
natural-product discovery is the rapidly diminishing rainforests, which hold the great- 
est possible resource for acquiring novel microorganisms and their products. The total 
land mass of the world that currently supports rainforests is about equal to the area of 
the United States (10). Each year, an area the size of Vermont or greater is lost to 
clearing, harvesting, fire, agricultural development, mining, or other human-oriented 
activities (10). Presently, it is estimated that only a small fraction (10-20%) of what 
were the original rainforests existing 1000-2000 yr ago, are currently present on the 
earth (10). The advent of major negative pressures on them from these human-related 
activities appears to be eliminating entire mega-life forms at an alarming rate. Few 
have ever expressed information or opinions about what is happening to the potential 
loss of microbial diversity as entire plant species disappear. It can only be guessed that 
this loss is also happening, perhaps at the same frequency as the loss of mega-life 
forms, especially because certain microorganisms may have developed unique specific 
symbiotic relationships with their plant hosts. Thus, when a plant species disappears, 
so too does its entire suite of associated endophytes and consequently all of the capa- 
bilities that they might possess to make natural products with medicinal potential. 
Multistep processes are needed now to secure information and life forms before they 
are lost. Areas of the planet that represent unique places housing biodiversity need 
immediate preservation. Countries need to establish information bases of their 
biodiversity and at the same time begin to make national collections of microorgan- 
isms that live in these areas. Endophytes are only one example of a life-form source 
that holds enormous promise to impact many aspects of human existence. The problem 
of the loss of biodiversity should be one of concern to the entire world. 
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Summary 

Harvesting and further developing the traditional herbal remedies in Ecuador (THME) is a very 
promising area from various perspectives that might be considered and subject to serious analysis. 
The coincidence of the richest biodiversity in South America confined within a small geographical 
area presents inherent scientific, pragmatic, and logistical advantages; moreover, Ecuador at this time 
uses the US dollar as its only currency. This implies that any economic planning can be done by 
investors using the widest known monetary instrument. Another important development taking place 
in this country is the effort presently under way to implement proper legal measures to protect foreign 
investment. At any rate, the endeavor of harvesting and developing THM requires in Ecuador, as in 
any other fragile location, an integral approach to avoid serious mistakes seen in the past. If such 
conditions are met, it is very likely that the participating individuals will be conscientious enough to 
place the interests of collectivity ahead of their own and protect, as their own, the very source that 
generates any potential earnings: a healthy and rationally used environment. 

Key Words: Traditional herbal remedies in Ecuador (THME); drug discovery; plant extracts; 
Ecuador. 

1. Introduction 

The knowledge that particular plants have therapeutic value often stems from tradi- 
tional knowledge and wisdom, which is frequently the true source of information for 
phytochemical investigation. This ancient knowledge is also the basis for further iden- 
tification of active components and, in many cases, the foundation for the development 
of new drugs. In fact, according to experts from the Food and Agricultural Organiza- 
tion (FAO) of the United Nations, 25% of prescriptions written by physicians in the 
United States and Europe are for compounds derived from plant extracts (I). These are 
obtained by what appears to be genuine discovery, but should be more accurately seen 
as clever development derived from observations made on the widespread traditional 
practices of ancient medicine. 

Selection of plants used in traditional practice for the treatment of common illnesses 
not only suggests a course of treatment to follow for specific pathologies, but also, and 
more importantly, provides a potential source of mixed active ingredients that may be 
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useful for drug development. As a matter of fact, the use of medicinal plants as raw 
material for the extraction of pharmacologically active components, as well as of pre- 
cursors for chemo-pharmaceutical semi-synthesis, is a common procedure utilized in 
the pharmaceutical industry; moreover, one-third of existing pharmaceutical products 
are derived from either plants or their components (2). 

The above-mentioned facts, taken together with the reality that the vast majority of 
plant species from the tropical rainforest have not yet been described or classified, 
highlights the need for further investigation, drug development, and, especially, imple- 
mentation of regulatory measures. Exploring this potential, with a focus on biological 
and chemical properties, is a wide-open research field with the promise of more potent, 
specific, and less toxic drugs. 

As an additional and important consideration, several authors have addressed the 
issue of the economic importance of exploring the qualities of natural products (3). 
This is supported by the fact that nearly half of the top-selling drugs in the world are 
either original plant or microbial compounds or their derivatives; moreover, the size of 
the plant-derived products market is approx $18 billon USD in the United States alone, 
underscoring the major importance of the worldwide business of plant-derived phar- 
maceuticals. As a matter of fact, international sales of these products have been esti- 
mated to be at least $200 billion USD, with an expanding perspective of growth for the 
coming years (4). 

The present issues about the use and commercialization of medicinal plants must 
lead to sustainable development strategies that potentially integrate with community 
participation and involvement, in order to create socially and environmentally sound 
businesses that protect, rather than harm, nature. This ideal situation can be achieved 
through international collaboration to design intelligent self-sustainable programs and, 
at the same time, with commitment to execute them in a responsible manner. Environ- 
mental law enforcement, locally and internationally, is the cornerstone to guarantee the 
proper use and not abuse, of natural resources. 

In the last 30 yr, the methodology for separation of active compounds has been 
dramatically improved through methods of analytical separation, structural elucida- 
tion, and quantitative analysis (5). The characterization, in minute detail, of complex 
chemical compounds can now be accomplished with relative ease, high speed, and 
reasonable cost. The design and use of potent assays has also achieved modern stan- 
dards and comprises manually efficient, semi-automatic, and fully automated systems 
that allow detection of small compounds proven to be active in a wide range of primary 
biological activities. 

Although the focus on production of synthetic drugs by the pharmaceutical industry 
has until now somehow limited the use of medicinal plants, the industry is again turn- 
ing to natural products, therefore creating the basis for a new opportunity for plant- 
derived compounds. In fact, renewed interest in this field has increased substantially in 
the past few years, emphasizing not only the existing technical feasibility for use and 
industrial production, but also the important economic potential underlying this matter. 

This chapter will discuss the advantages of biodiversity and the possibilities of cre- 
ating a profitable enterprise while developing sustainable strategies that protect 
biodiversity. 
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2. Biological Considerations 

2. 1 Preserve and Explore the Biodiversity Without Endangering 
the Environment 

Geographical locations, as well as topographical, edaphic, and climatic conditions have 
resulted in the magnificent ecological mosaic displayed in Ecuadorian biodiversity. These 
extensive, complex, and inherently rich environmental conditions have also generated 
an impressive diversity of natural ecosystems in which different species and varieties 
of plants and animals have successfully evolved, adapted, and flourished (6). 

Variations in altitude, humidity, pressure, and precipitation in each zone are reflected 
in the always-variable vegetation coverage. An entire range of examples of this can be 
appreciated in a small area; in fact, in Ecuador we observe a dramatic change in ecosys- 
tems, from locations with annual grass-dominated deserts, which depend on occasional 
rain, to very humid tropical forests, including large conglomerates of ancient trees that 
depend on high precipitation throughout the year, and all of them confined within a 
restricted geographical location (7). These small examples of very different regions 
that are generated by a simple gradient in the occidental region within a very small 
area, emphasize the peculiarities of the country. The Andean foothills also possess 
these gradients, and they generate natural wonders that range from the typical humid 
forests to herbaceous-dominated bleak plateaus, and deserts in the high mountains (8). 

Ecuador also possesses a great variety of aquatic ecosystems, including marine, 
coastal, and insular environments, as well as fluvial and continental lagoon areas. These 
ecosystems shelter a great diversity of organisms, many of which are of economic 
importance. Although information about diversity at the flora and fauna species level is 
scarce, dispersed, and heterogeneous, preliminary data confirm the existence of an enor- 
mous biological richness (9). 

According to Steere (10), Ecuador is the country with the largest number of plant 
species per unit area in South America. Its flora encompasses approx 20,000 to 25,000 
species of vascular plants, many of them unique, and with an estimated endemic value of 
20% based on local flora distribution patterns. In the same way, the diversity of 
nonvascular plants is very high, although no estimate of the total number of species 
existing in the country is available. These realities highlight the need for investment in 
science and technology to explore a largely untouched world full of promise. Further- 
more, investigations of the genetic diversity of natural plants of Ecuador reveal the 
presence of an extraordinary diversity in phytogenetic resources that might be impor- 
tant for improving the production of unique natural foods and for diversifying cultiva- 
tion (11). 

Myers (12) identified 10 areas that harbor somewhere between 30 and 40% of the 
world's biological diversity, but some of the locations are the most threatened by 
human activities. These areas, characterized by an exceptional concentration of flora 
and fauna and with a large percentage of endemic species, are known in the environ- 
mental sciences as "hot zones" or "hot spots," and comprise 1% of the surface of the 
planet. In spite of its very small surface, Ecuador contains three of these zones which 
are the very humid tropical forests of the occidental coast, the external flanks of the 
Andean-range mountain chain, and the Amazon forests of the northeastern area. The 
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protection and preservation of these areas is considered to be one of the top priorities 
for the conservation of biological diversity at the global level. In this regard, we are 
witnessing, in the last few years, what seems to be independent events such as varia- 
tions in weather, alteration in rain patterns, ozone-layer deterioration, polar cap melt- 
ing, and other environmental tragedies. It is clear now that all of these are related to the 
nonrational use and abuse of natural resources and destruction of nature (13). Eco- 
nomic interests with no consideration for damage to nature have greatly surpassed the 
capability of nature to restore its losses, and have led to relentless destruction. Simi- 
larly, and among other factors, the destruction of habitats, the overexploitation of 
resources, the introduction of nonnative species, and environmental contamination 
are leading to the disappearance of Ecuador's flora and fauna species. This progressive 
detriment of the habitat is also leading to the irreparable loss of valuable genetic infor- 
mation stored in wild species and biodiversity. 

Countless species have already disappeared, and several hundred more are in danger 
of extinction due to the destruction of occidental forests. An isolated but sad example 
of this fact is the Dicliptera dodsoni species, endangered as a result of the conversion 
of very humid forests of the coast into banana and African-palm plantations, as well as 
into land for pasture (14). Some other species, like Tabebuia chrysantha (Guayacan), 
which is the most valued wood in dry tropical forests of the coastal area, have greatly 
diminished due to indiscriminate exploitation (14). The list of destroyed or perma- 
nently damaged plant species is enormous and beyond the scope of this chapter; how- 
ever, it emphasizes the need for more rational and intelligent development procedures, 
associated with stronger policies specifically designed to protect extremely fragile 
resources. In similar considerations, the displacement of native crops as well as the new 
agricultural practices are causing the accelerated disappearance of genetic resources stored 
in native cultivated species ( 15). Indeed, modern agriculture is promoting the replacement 
of indigenous species and varieties with new species and biotechnologically modified 
varieties. Even if we consider that under some conditions these newly developed or 
modified plant species are "more economically efficient" and indeed could raise the 
production of individual parcels, the unification of crops may result in massive losses 
of general harvests due to plagues, drought, or frosts induced by severe alterations in 
the environment caused by modern technologies. This vicious circle can be found in 
almost every location of Ecuador; moreover, it has greatly contributed to the already 
massive destruction of large areas of the Ecuadorian rainforest (16). 

2.2. Bio-Prospecting for Natural Resources in Developing Countries 

Bio-prospecting for natural resources comprises the study and collection of biologi- 
cal specimens from a certain location, aiming to discover biological species with com- 
mercial or industrial use (17). One of the oldest and best-known examples is that of 
quinine, extracted from the "chinchona" or "cascarilla," a common plant of the Andes 
(10). This life-saving plant was studied and quinine was extracted and used for thera- 
peutic purposes worldwide. Some people from the drug industry, fortunately not all, 
claim that uneducated people, such as shamans, have never found a pure compound, 
and that is true. However, it is also true that the scientist would never have found many 
compounds without ancient knowledge, including the hundreds of years of empirical 
experimentation with plants. 
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In Ecuador, recently established companies such as Shaman International demon- 
strate slight improvements over previous practices (18). This firm specializes in inves- 
tigating traditional medical knowledge to enhance the possibilities of finding promising 
active ingredients. Shaman's investigations in Ecuador and Peru yielded two active- 
ingredient patents from Croton lechlieri; the patents belong exclusively to the com- 
pany. This entity continues funding the production of large quantities of raw material; 
however, it is expected that this production will generate jobs for people within the 
area and thereby benefit the local economy. Through their foundation, the Healing 
Forest Conservancy, Shaman Pharmaceuticals has signed contracts with several Ecua- 
dorian and Peruvian communities in order to cultivate Croton lechlieri and buy from 
them the corresponding raw materials, generating jobs while saving taxes because of 
the legal advantages that foundations have in Ecuador. These types of new approaches 
have to be evaluated and regulated by authorities and promoted if proven beneficial to 
the country. 

At any rate, it is important to acknowledge that any new plantations must be devel- 
oped with great caution, since single-crop farming might cause further damage to an 
already altered environment. It is also essential to analyze whether these newly devel- 
oped plantations will eventually replace primary forests or other subsistence planta- 
tions. If that is the case, a sound plan for regional development and substitution of 
primary forests must first be developed. 

2.3. An Optimistic Scenario 

Aggressive bio-prospecting efforts and leakage of genetic resources out of 
biodiversity-rich countries, such as Ecuador, are problems that should be addressed 
through the clear commitment of scientists who have in mind the importance of the 
development of the therapeutic potential of a large variety of Ecuadorian plant species. 
Moreover, the rational design of research protocols, aiming to test both safety and 
efficacy of promising Ecuadorian plant species, will certainly lead to sound proposals 
for the international community and for the industry. The fact that scientific studies can 
be done more economically in Ecuador than in other countries is an advantageous fea- 
ture that might attract potential investors and companies. Nowadays, the existing thera- 
peutic leads, as well as the already proven uses of plant extracts, are provided not only 
by ancestral knowledge but are also being developed by local scientists. International 
cooperation and the industry will find a fertile soil for their specific aims, be they 
conservation, science, education, self-sustainable development, or profit. 

2.4. Economical Considerations of Biodiversity 

The assignment of an economic value to biodiversity is not an easy task; neverthe- 
less, a few examples can be used to obtain an estimate of the economical potential of 
this resource. One of the best evaluations was performed by Simpson, Sedjo, and Reid 
(17), who concluded that the pharmaceutical industry is willing to invest more for con- 
servation of the habitat in Ecuador than in any other part of the world. They showed 
that the industry will not only greatly benefit in general, but indeed that ultimately it 
will save money if makes a minimum annual investment of $20 USD per hectare to 
implement preventive measurements against deforestation. Calculations have been 
made, e.g., those of Adger et al. (19), that estimate an annual average value of $22.4 
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USD per hectare for biological conservation of the forests of Mexico. Using param- 
eters such as those used by these authors, but considering that the biodiversity index of 
the high Amazonian basin (Ecuador) is much higher (20), the corresponding estimate 
for Ecuador is also much elevated. 

Biodiversity poses an enormous economic potential and can produce innumerable 
benefits by serving not only as the origin of raw material but also as the source of 
miscellaneous ingredients for the chemical industry, drug production, and other related 
activities. For this reason, prospecting biodiversity and its commercial application have 
gained special attention (21). Collaboration between nations and the pharmaceutical 
industry have met with success; the most well-known example is the agreement between 
the INBio (3) in Costa Rica and Merck Ltd. Unfortunately, Ecuador and most other coun- 
tries in Latin America have not yet fully developed and benefited from their biological 
potential. It is clear that during the next few years, cooperative efforts of this kind will 
flourish more broadly. Therefore, it is necessary to establish means that support and 
regulate bio-prospecting to make it profitable and allow the use of biodiversity in a 
sustainable manner. A country such as Ecuador, rich in germoplasm and traditional 
knowledge, might well be a suitable partner for participation within the borders of the 
ever-expanding pharmaceutical market. Several studies of medicinal plants have only 
just begun to describe the natural product potential of Ecuador's biodiversity. In this 
sense, Vogel (22) referred mainly to two studies — that of Ruiz (23), who published a 
list of the 43 species of the Andean region with 2.7 medicinal properties per species; 
and that of Mena (24), who documented the useful therapeutic plants in the low north- 
occidental area, listing 63 species with a median of 2.6 medicinal properties per spe- 
cies. After eliminating duplication of species in the two regions, Vogel calculated a 
total of 256 medicinal properties. 

The multiplication of this number by known current values ($466.00 USD), accord- 
ing to Aylward calculus (25), and by 2500 (the 25% bioactivity calculus in agreement 
with Balick (26), and by 7.5 (the efficient bonus rate of 15%), results in an expected 
potential value of this list of traditional plant medicines in the range of $2.24 billion 
USD (8). Ecuador must increase its efforts to obtain a larger share of this promising 
natural-products market. 

2.5. Alternatives for Development That Preserve the Environment: 
Sustainable Development 

In its classic form, the concept of sustainable development has been described as 
"The development that meets the needs of the present without compromising the abil- 
ity of future generations to meet their own needs" (27). However, we are daily witness- 
ing the dramatic consequences of the destruction of the environment, including 
dramatic changes in weather, rain patterns, and the disappearance of species. 

Simple solutions, such as those of cultivation of the desired plant species as an alter- 
native to forest degradation, can be easily implemented. In this respect, an edaphic 
assessment of the land and rational use of it can be scientifically ascertained and the 
most appropriate type of cultivation selected; therefore, efficient productivity can be 
achieved without destruction or major alteration of primary forest areas (14). In accor- 
dance with this general philosophy, community participation should be utilized — for 
example, by combining production with traditional agricultural practices that lead to 
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achievement of an appropriate balance in agro-forestry, therefore providing additional 
economic income to those communities who wish to collaborate. In this model, local 
communities can provide raw materials in addition to maintaining their traditionally 
grown crops. These simple measures will not only create jobs and economic benefits 
for those communities, but will also serve to dignify, without charity, the lives of these 
presently impoverished individuals. 

3. Medicinal Plants in Drug Development 

3. 1. Phytotherapeutics As an Alternative System of Primary Medicine 

In developing countries, the use of medicinal plants is highly controversial because 
of lack of knowledge and poor understanding of the contextual circumstances by both 
the public and the authorities responsible for regulating the use of these types of medi- 
cines. In fact, there is an erroneous political attitude supposedly aimed to avoid of- 
fenses to the owners of ancestral knowledge, considered to be a discrete minority. In 
reality, the ministries of health of developing countries display a dual and contradicto- 
rily political behavior that never takes a clear and univocal attitude towards proper 
regulation; moreover, these authorities prefer, most of the time, to simply ignore the 
widespread nonjudicious use of botanicals that leads to, in many cases, public and 
specific health dangers and even catastrophes. For instance, and without any objective 
action being taken from the regulators, it is not uncommon to observe the unaffected 
practice of some witch-doctors who advise the public against the use of vaccines, and 
their replacement with herbal teas and ineffective and dangerous measures. 

It is true that an ethno-pharmacological tradition exists; however, many recent 
entrepreneurs of botanicals have created a vast, frequent, and indiscriminate use of 
plants and plant extracts by millions of people, generating not only health dangers but 
also permanent destruction of large areas of the precious and irreplaceable primary 
rainforest. In multiple instances, the real therapeutic potential of these resources is 
assumed and promoted, but there is little if any systematic scientific proof of both 
efficacy and safety. The determination of the efficacy of medicinal plants and their 
therapeutic usefulness is of primary importance for therapy as well as for safety. This is 
critically important nowadays because of the proliferation of small, unregulated firms 
that promote and market the use of hundreds of poorly studied natural remedies. In 
fact, the majority of plants used in folk medicine, and phytomedicines in general, are 
traditionally sold over the counter, and lack adequate pharmacological, toxicological, 
and clinical evaluation (28). In this context, self-medication and uncontrolled use of 
homemade remedies are undeniable facts that sooner or later will generate health dete- 
rioration. It must be remembered that, in contrast to wealthy communities, the use of 
these products in developing regions is not an alternative to other effective medicines, 
but indeed the only option for primary health care (5). Therefore, implementation of 
good practices for production, development, and use of these compounds, as well as 
their scientific evaluation and separation of toxic products, are of primary importance 
to public health. Recent history has shown that the tools needed to discover, character- 
ize, and fully study a pharmaceutical remedy can be effectively and properly used un- 
der cooperative programs between academicians and the pharmaceutical industry; 
similar measures for proper pharmaceutical development of botanicals need to be 
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implemented. As an example, the previously mentioned discovery of the medical use 
of quinine by ancient aborigines and further characterization of quinine by the industry 
clearly epitomizes a strategy that has been successfully used in recent decades (10). 
We believe that the correct approach is for all parties to share and combine knowledge 
and technology to obtain new compounds for pharmaceutical use and profitability, both 
for the pharmaceutical industry as well as for the rightful owners of the initial discov- 
ery. In this fair fashion, new remedies obtained from the use of traditional practices 
and technologically advanced measures, within the framework of an ethically correct 
approach, might prove to be a helpful strategy to solve the present health crisis in 
underdeveloped countries. Moreover, these actions will have additional beneficial 
effects, both by their inherent action and also by indirect mechanisms, such as the 
generation of jobs. Taken together, these aspects and perspectives suggest that devel- 
opment of national programs to integrate the different partners of this large endeavor 
would be highly desirable; moreover, governments must also participate in the proper 
regulation of safety, production, quality control, and marketing of botanicals. 

It must be emphasized that a significant percentage of the Latin American popula- 
tion does not have access to proper medical care. It is believed that fully one-half of the 
people in this part of the continent have limited access, or no access at all, to the most 
needed medicines and drugs. The direct impact of these statistics falls on children and 
lactating mothers. In 1980, for example, the worldwide consumption of medicines 
reached approx $80 billion USD, of which only $6.4 billion (8%) were used in Latin 
America (21). 

In addition, the increased control of the local market by foreign enterprises, the low 
percentage of local investment, and negligible participation of governmental entities in 
drug supply have also influenced the shift in preference by the poorest and neediest 
sectors of the population toward traditional medicine and unvalidated natural products. 
It is believed, as ascertained by officials of the Institute for International Cooperation 
and Development, that as much as 70% of the population use plants as their sole source 
of remedies for miscellaneous illnesses in this part of the world (29). The fact that the 
vast majority of these products have no controlled proof of efficacy, no studies done on 
their safety profile, and are used by an extremely fragile part of the population speaks 
to the dire situation in Latin America in terms of human health and social development. 

In response to these circumstances, the World Health Organization (WHO), in the 
context of its Resolution WHA 31.33, has recognized the importance of medicinal 
plants for primary healthcare in this part of the planet and has recommended to its 
Member States the use of a comprehensive approach to medicinal plants (30). Among 
others, the following recommendations have been made for State Members: 

• A periodic update of the inventory and classification of medicinal plants used in different 
countries. 

• The establishment and development of scientific criteria and methods to ensure the qual- 
ity of medicinal plant preparations as well as the study of their efficacy in the treatment of 
specific conditions and illnesses. 

• The development of international standards and specifications for identification, purity, 
potency and good manufacturing practices. 

• The development of a manual of methodologies for the effective and secure use of phy to- 
therapeutic products by health professionals. 
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• The distribution of all related information to Member States. 

• The allocation of resources by local governments for research and training of human re- 
sources for the study of medicinal plants. 

• The initiation of global programs for the identification, evaluation, preparation, culture, 
controlled growth, and conservation of medicinal plants used in traditional medicine. 

In addition, and as discussed previously, the market for plant-based medicines in 
industrialized countries is very important and displays an ever-expanding pattern. 
Plants are also commonly used by the pharmaceutical industry as raw materials for 
many of its products. In the last decade, the market for botanicals was estimated in the 
range of $35 billion USD per year (3), with a considerable increase during the last five 
years. 

In summary, medicinal plants are used in every country as both a raw material and in 
the form of plant extracts, as semi-purified forms, as an inexpensive source of active 
compounds for chemical or semi-synthetic procedures, and for other miscellaneous 
uses. The large increase in the use of medicinal plants for primary healthcare in Latin 
America, as well as the unusually high interest in naturally derived medications in 
developed countries, constitute a very promising opportunity not only for scientists 
and researchers in the pharmaceutical industry but also for entrepreneurs and investors. 

3.2. Therapeutic Potential of Plants in Ecuador 

In the last few years, the scientific investigation of plants has received wider atten- 
tion. Innumerable plant species have been selected and studied in the search for thera- 
peutic effects and qualities; as a consequence, interesting and novel medicinal 
properties have been described. Many active compounds have been further character- 
ized and are used in the therapeutic armamentarium of modern physicians, who are 
often unaware of the botanical origin of the medications they commonly prescribe. 

A vast variety of botanicals are available as nonprescription as well as prescription 
drugs. In addition to their use in the pharmaceutical industry, plant extracts and their 
derivatives are also used in the cosmetic and other industries, including the food busi- 
ness. Botanists living in Ecuador have only recently begun to explore its rich resources. 
In addition to the Amazon Basin, other important plant resources are found in the 
Ecuadorian Sierra (Andean Region) (4). The following are just a few examples of the 
many hundreds of interesting plant species that can be found growing wild in the vari- 
ous regions of Ecuador: Bidems leuccinta Willd, Bidems pilosa Linneo, Ceratocephalus 
piiosus, Rich, Kerneria duvia Cass, and Kerneria tetrdgona Moench, commonly named 
Morisco, Putso, or Shiran. The last-cited plant is described as having hypotensive, 
hypoglycemic, and antiviral activities. It contains compounds proven to be phototoxic 
to bacteria and fungi (31). It is common knowledge that among its traditional uses, its 
infusion produces diuretic effects, and its topical application, as an ointment, helps to 
relieve pain. Many old people chew the plant leaves to alleviate their rheumatic pain, 
apparently with good results. Notably, in many instances, it might be assumed that its 
use is preferred because of its low cost; however, in some locations, people with proper 
income and access to expensive drugs still prefer to use these leaves instead of sophis- 
ticated anti-inflammatory pharmaceutical compounds, due to negligible side effects. 

Culcitium reflexion HBK, Lasiocephalus ovayus Schlecht, commonly named 
Arquitecta, has traditional medicinal uses as a diuretic and a "purifier or depurative." It 
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has also been used in the treatment of syphilis (31). Xantium catharticum HBK, com- 
monly named Cashamarucha, has been traditionally used in folk medicine as a diuretic 
and purgative. It is a popular and widespread belief that women should not use this 
plant, as witch-doctors warn that it also possesses abortive properties. Chuquiragua 
insignes, commonly named Chuquiragua, have a traditional medicinal use against 
hepatic problems and also as a diuretic, tonic, and for relief of persistent coughs. An 
infusion of its flowers is commonly recommended (32). 

Bacharis riparia HBK, Bacharis polyanta, Bacharis latifolia HBK, commonly 
named Chilca, Chilca negra, Chiza, or Yana chilca, has a traditional use as an infusion 
to alleviate diarrhea in children (31). This plant is commonly used throughout the 
Andean region of Ecuador. Needless to say, this remedy is the only alternative in some 
impoverished locations of this region. Senecio vaccinoides Kunth, commonly named 
Cubilan, Ayalongo, or Cubillm, is used as a potent analgesic when taken as an infusion 
(31). The more intense the pain, the more concentrated the infusion must be. 
Althernanthera pamculata, commonly named Moradilla, is used as an infusion to alle- 
viate renal and hepatic diseases. Franseria artemisoides Willd, commonly named 
Marco, Altamisa, or Marcu, is commonly used as a topical agent to alleviate pain. The 
flowers of this plant are used for the preparation. It is also employed as a flea repellent 
(32). Aristiguietia glutinoso, Eupatorium glutinosum HBK, commonly named Matico, 
Soldier herb, or Chuzalongo, is one of the most popular remedies in the Ecuadorian 
Sierra (Andean region). Its infusion is traditionally used to cure gastritis and ulcers. A 
preparation of the steamed or cooked leaves is used to alleviate rheumatic pain, renal 
disease, and to clean external wounds. 

It should be mentioned that some preliminary phytochemical analyses of several of 
the aforementioned plant species have been carried out. Investigation of Schistoscarpa 
aff euphat oroides and Grias neubertii McBride has revealed the presence of numerous 
terpenoid structures that might indicate an inherent antimycotic activity (33). In addi- 
tion, the presence of sesquiterpene hydrocarbons among its structural components sup- 
ports the popular use of this plant as an insecticide. In addition, the antibacterial and 
antimycotic activity of Minguartia guianensis Aubl. and Clavija procera have been 
studied by Villacres and collaborators (34). 

With regard to other Ecuadorian medicinal plants, the anti-inflammatory activities 
of several species have been studied in some detail by using various in vivo models 
(35), most frequently of the so-called carragenin-induced edema (36). These models 
allow the documentation, on a qualitative and semi-quantitative basis, of the relative 
anti-inflammatory capabilities of plant extracts. The determination of the range of activ- 
ity and not of the mechanism underlying the anti-inflammatory process by these methods 
is a limiting technological problem that needs further assessment and refinement to 
obtain the active principles responsible for the observed effects. At any rate, these 
models are extremely useful, as they allow an objective evaluation of a discrete activity 
that can be, at least, a starting point in the elucidation of the active compounds capable 
of interfering in the genesis of the inflammatory process. In fact, carragenin-induced 
rat-plantar edema, as described by Wintern (37), is a very sensitive model that allows 
evaluation of anti-inflammatory substances. Using this methodology, the authors stud- 
ied Wistar rats distributed in homogeneous groups of 10 animals each (35). Extracts, 
positive and negative controls, and pattern batches were tested using this model. The 
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Table 1 

Relative Anti-Inflammatory Activity of Aqueous Plant Extracts 



Investigated substance 


3 h (%) 


5 h (%) 


7 h (%) 


Phenylbutazone 


59.32 


51.67 


62.96 


Baccharis trinervis 


-11.87 


10.83 


60.49 


Baccharis teindalensis 


-25.42 


-15.00 


56.79 


Eupatorium glutinosum 


25.42 


18.33 


43.21 


Eupatorium articulatum 


37.29 


41.67 


49.38 


Tagetes pusilla 


16.95 


18.33 


20.99 


Conyza floribunda 


-16.95 


-8.33 


62.96 


Neurolaena lobata 


-5.08 


20.00 


58.02 


Urena lobata 


8.47 


21.67 


60.49 


Croton draco 


-8.47 


-8.33 


59.26 



Negative values indicate larger plantar edema than that seen in the control animals 
(these received only water ad libitum). 



results obtained can be seen in Tables 1-6. Briefly, Table 1 shows the anti-inflamma- 
tory activity of various aqueous plant extracts studied with the carragenin-induced 
rat-plantar edema method (reference drug phenylbutazone, a well-known anti-inflam- 
matory agent). Results are expressed as the percent activity of the aqueous plant 
extracts, at different times (3, 5, and 7 h); phenylbutazone had a percent activity of 
59.32, 51.67, and 62.96 at those times. Table 2 and Table 3 show the activities gener- 
ated by the plant extracts obtained by using a dichloro-methanic (Table 2) and by an 
ethanolic (Table 3) extraction method, respectively and at the same times. 

Tables 4-6 show the relative anti-inflammatory activities (RAA) of different plant 
extracts (aqueous, dichloro-methanic, and ethanolic, respectively). The common sta- 
tistical method of the Student's t test was used for all comparisons. In these studies, the 
RAA of all plant extracts was compared to that of phenylbutazone, to which a value of 
1 was putatively ascribed. The statistical analysis included, sequentially, the use of the 
Kolmogorov test to ascertain whether any two data samples were compatible with be- 
ing random sampling of the same, unknown distribution, of all the data generated from 
both the use of the extracts and phenylbutazone as well as from the controls. Thereaf- 
ter, the Student's t test was used to establish the differences between the groups and a 
statistical p value less than 0.05 was set as the observed level of significance. This 
means that any p value similar to or <0.05 or <0.01 found, shown in Tables 3-6, corre- 
sponding to any of the extracts, signifies that this discrete and particular compound has 
an anti-inflammatory activity comparable to or better than phenylbutazone. On the other 
hand, the nonsignificance (abbreviated as ns) of any value generated from a given com- 
pound means that such extract does not possess any anti-inflammatory activity, at least, 
as assessed by this methodology. 

In summary, results from Tables 1-6 lead to the following observations (35): 

1 . When compared with the pattern drug, phenylbutazone, the anti-inflammatory properties 
of all aqueous extracts, with the exception of Eupatorium articulatum, showed a relatively 
low anti-inflammatory activity at 3 and 5 h after administration. At 7 h, these extracts 
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Table 2 

Relative Anti-Inflammatory Activity of Diclorometanic Plant Extracts 



Active substance 


3 h (%) 


5 h (%) 


7 h (%) 


Phenylbutazone 


39.69 


53.22 


93.75 


Baccharis teindalensis 


5.00 


-3.08 


-16.25 


Baccharis trinervis 


9.37 


5.60 


-0.94 


Eupatorium glutinosum 


2.19 


31.93 


62.50 


Tagetes pusilla 


46.88 


31.93 


62.59 


Conyza floribunda 


42.81 


25.21 


37.50 


Neurolaena lobata 


41.56 


36.41 


33.44 


Urena lobata 


33.44 


44.82 


33.44 


Croton draco 


35.31 


28.01 


12.50 


Eupatorium articulatum 


37.50 


52.38 


41.56 



Negative values mean larger plantar edema than that seen in the control animals (these received 
only water ad libitum). 



Table 3 

Relative Anti-Inflammatory Activity of Ethanolic Plant Extracts 



Investigated substance 


3 h (%) 


5 h (%) 


7 h (%) 


Phenylbutazone 


29.13 


50.59 


44.69 


Baccharis trinervis 


12.62 


32.94 


21.79 


Eupatorium glutinosum 


26.21 


38.82 


22.91 


Tagetes pusilla 


40.78 


62.35 


64.26 


Conyza floribunda 


44.57 


36.47 


29.61 


Neurolaena lobata 


0.00 


14.71 


19.55 


Urena lobata 


6.81 


20.59 


21.79 


Eupatorium articulatum 


59.22 


42.39 


53.63 



reach their maximum activity, which was similar to that of phenylbutazone. Six of the 
nine extracts showed a relative activity of 0.9 to 1 in comparison with 1 of the pattern 
drug. 

For the anti-inflammatory activity of the dicloromethanolic extracts, the maximum activ- 
ity was observed after 3 h of administration, with a decrease at 5 h and a further decrement 
at 7 h. Interestingly, six of the nine extracts showed their maximum relative activity at 3 h, 
with a range between 0.84 and 1.18 with respect to 1 of the pattern drug phenylbutazone. 
The ethanolic plant extracts were the most active: various degrees of important relative 
activity were documented at 3, 5, and 7 h. Surprisingly, the relative activity of some of 
these extracts was superior to that of the pattern drug at various times. The plants with the 
highest relative activities were Tagetes pusilla (Asteraceae), with 1.4, 1.23, and 1.44, and 
Eupatorium articulatum (Asteraceae), with 2.03, 0.84, and 1.20, at 3, 5, and 7 h, respec- 
tively, and Conyza floribunda, with 1.53 at 3 h, as compared with 1 of phenylbutazone. 
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Table 4 

Relative Anti-Inflammatory Activity (aa) of Aqueous Plant Extracts 

As Compared With Phenylbutazone (aa = 1.0) 





Relative anti 


-inflammatory activity 


Statistical 


analysis (stu 


dent's t test) 




RAA 


RAA 


RAA 




Plant species 


at3h 


at 5 h 


at7h 


3h 


5h 


7h 


Croton draco 


0.00 


0.00 


0.94 


ns 


ns 


p < 0.01 


Tagetes pusilla 


0.29 


0.36 


0.33 


p < 0.05 


ns 


p<0.01 


E. glutinosum 


0.43 


0.36 


0.69 


ns 


ns 


p<0.01 


Baccharis teindalensis 


0.00 


0.00 


0.90 


ns 


ns 


p<0M 


Baccharis trinervis 


0.00 


0.21 


0.96 


ns 


ns 


p<0.0l 


N. lovata 


0.00 


0.39 


0.92 


ns 


ns 


p<0M 


Conyza floribunda 


0.00 


0.00 


1.00 


ns 


ns 


p<0.0l 


Urena lobata 


0.14 


0.42 


0.96 


ns 


ns 


p<0.01 


Eupatorium articulatum 


0.63 


0.81 


0.68 


ns 


p<0.01 


ns 



Ap value less than 0.05 (p < 0.05) is considered statistically significant. 
RAA, relative anti-inflammatory activity; ns, not statistically significant 



Table 5 

Relative Anti-Inflammatory Activity (aa) of Dicloromethanic Plant Extracts 

As Compared With Phenylbutazone (aa = 1.0) 

Relative anti-inflammatory activity Statistical analysis (student's t test) 

RAA RAA RAA 

Plant species at3h at5h at7h 3h 5h 7h 

Croton draco 0.89 0.53 0.13 p < 0.01 p < 0.05 ns 

Tagetes pusilla 1.18 0.60 0.67 p < 0.01 p < 0.05 p < 0.01 

Eupatorium glutinosum 0.06 0.00 0.03 ns ns p < 0.05 

Baccharis teindalensis 0.13 0.00 0.00 ns ns ns 

Baccharis trinervis 0.24 0.11 0.00 ns ns ns 

N. lovata 1.05 0.68 0.36 p < 0.01 p < 0.01 p < 0.05 

Conyza floribunda 1.08 0.47 0.40 p < 0.01 p < 0.05 ns 

Urena lobata 0.84 0.84 0.36 p < 0.01 p < 0.01 p < 0.01 

Eupatorium articulatum 0.94 0.98 0.44 p < 0.01 p < 0.01 p < 0.01 

A p value less than 0.05 (p < 0.05) is considered statistically significant. 
RAA, relative anti-inflammatory activity; ns, not statistically significant 



In a different but related matter, it is important to mention that, among the many 
Ecuadorian plant-derived products already commercialized, there are some apparently 
novel products presently being heavily marketed and sold in large quantities. Some of 
them have been promoted as having protean and intense activities on various bodily 
systems, including the immune, cardiovascular, gastrointestinal, and dermatological 
systems. These agents are also not only being sold in Ecuador, but also exported to 
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Table 6 

Relative Anti-Inflammatory Activity (aa) of Ethanolic Plant Extracts 

As Compared With Phenylbutazone (aa = 1.0) 





Relative anti 


-inflammatory activity 


Statistical 


analysis (student's t test) 




RAA 


RAA 


RAA 








Plant species 


at3h 


at 5h 


at7h 


3h 


5h 


7h 


Tagetes pusilla 


1.40 


1.23 


1.44 


p<0.01 


p < 0.01 


p < 0.01 


Eupatorium glutinosum 


0.90 


0.67 


0.51 


p < 0.05 


p < 0.05 


ns 


Conyza floribunda 


1.53 


0.72 


0.66 


p<0.01 


p < 0.01 


p < 0.05 


Baccharis trinervis 


0.43 


0.65 


0.49 


ns 


p < 0.05 


ns 


N. lovata 


0.00 


0.29 


0.44 


ns 


ns 


ns 


Urena lobata 


0.23 


0.41 


0.49 


ns 


ns 


ns 


Eupatorium articulatum 


2.03 


0.84 


1.20 


p<0.01 


p < 0.05 


p < 0.01 



A p value less than 0.05 (p < 0.05) is considered statistically significant. 
RAA, relative anti-inflammatory activity; ns, not statistically significant. 

other countries as "magic" entities aimed to cure cancer, all types of rheumatic disor- 
ders, as well as other human diseases. In spite of anecdotal accounts of efficacy of 
these compounds, supposedly by mechanisms known only by their promoters, there is 
no credible evidence that supports any clear therapeutic activity. For these reasons, we 
firmly believe that these products need to undergo real scientific testing with rigid 
regulatory procedures included in the study protocols. This will not only ensure human 
health and document possible side effects, but will also save money and avoid major 
fraud to people. 

The enormous therapeutic potential derived from plants is a very promising alterna- 
tive for drug discovery, and could also help to provide a model for genuine sustainable 
development that preserves, restores, and intelligently uses natural resources. The 
implementation of such a system will not only help to create jobs, wealth, education, 
and environmental consciousness in developing nations, but will also create a model of 
true feasibility of the coexistence of environmentally protective endeavors along with 
profitable businesses. This is especially important at the present time, when the indus- 
trialized and rich nations of the world are mainly responsible for the worst pollution the 
world has ever seen. 

4. Legal Considerations 

Besides major traditional legal principles already included or adapted as integral 
parts of legislation of most countries, and besides declaratory or law-designing meet- 
ings, including the very well-known Rio de Janeiro Biodiversity Treaty, the Kyoto 
Protocol, and other environmental legal milestones, it is pertinent to briefly cite some 
legal instruments relevant to the Ecuadorian context. 

4. 1. Ecuadorian Legislation 

In 1996, because of the necessity of determining the value of wild species and tradi- 
tional knowledge in all projects included in the so-called genetic resource area, the 
Ecuadorian Congress approved the "Law that protects the biodiversity in Ecuador." 
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Similarly, in 1998, the legislation approved the legal body that addresses intellectual 
property. It is contained in "The Intellectual Property Law" issued at that time (13). 

The law that protects biodiversity was published in Official Register No. 35, dated 
September 27, 1996. This law defines biodiversity as a national good of public use and 
ratifies the contents of the Biological Diversity Agreement, issued for those countries 
with sovereign rights over biological resources. The rights of indigenous communities 
over all knowledge associated with biodiversity are also recognized in this law. Never- 
theless, and because of its brevity and the mainly declarative character of its articles, 
the law is insufficient to regulate the majority of multifarious and miscellaneous fac- 
tors related to the conservation and sustainable use of biodiversity. 

The Ecuadorian Policy on Forestry, Natural Areas, and Wildlife (23) favors the con- 
servation and sustainable use of biodiversity (38). This policy encourages the coordi- 
nation and participation of institutions involved in this area to design a strategy for the 
investigation, monitoring, evaluation, and protection of biodiversity. In this sense, this 
policy is aimed at regulating issues such as the evaluation of markets for products and 
services derived from biodiversity, the establishment of access or restriction to 
resources of biodiversity, and the creation of biodiversity investigation programs. 
It also includes the determination of procedures for economic, scientific, ecological, 
genetic, and cultural evaluation of the products and services derived from biodiversity, 
as well as the development of alternatives for the protection and management of bio- 
logical resources. 

4.2. International Legislation: The Andean Pact 

In reference to treatment of intellectual property rights in countries belonging to the 
Andean Pact, two standards were established: Decision 345 and Decision 344 (38). 
Decision 345 recognizes and guarantees the rights of the developers of new vegetable 
varieties through an achiever certificate, as follows: "Member States will confer an 
achiever certificate to those persons who have created vegetable varieties when these 
are new, homogeneous, distinguishable and stable, and have been assigned a denomi- 
nation that constitutes its generic designation. It should be understood that the term 
'create' refers to the achievement of a new variety through the application of scientific 
knowledge or the hereditary enhancement of the parts." 

Decision 344 establishes patents and states the following: "Member States will con- 
fer patents for inventions which are products or procedures in every technological field. 
Products and procedures should be new, have an inventive level and be susceptible of 
industrial application. An invention is new when it is not comprehended in the techni- 
cal statement, meaning that it has not been accessible to the public through an oral or 
written description, through its use or in any other way before the date of patent solici- 
tation, in the case of recognized priority." 

5. Conclusions 

Considering that Ecuador possesses some of the greatest biological diversity in the 
world despite consisting of less than 0.2% of the planet's surface, an investment in 
developing its natural resources seems appropriate. This small country has the highest 
biodiversity per hectare in all of South America, and harbors approx 25,000 vegetal 
species that encompass enormous therapeutic potential. 
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In this context, the feasibility of implementing research studies locally and at a very 
reasonable cost is one of the most attractive features for developing partnerships among 
entities from both the developed and developing countries, including Ecuador. These 
study protocols, aimed at testing for safety and efficacy of the plant extracts, are the 
cornerstone of modern and rational drug development. The latest introduction of appro- 
priate review boards that meet international standards, guarantees the enforcement of the 
most important aspect of medical investigation in the world: the ethical compliance by 
both local and international partners. 

The feasibility of locally implementing sound scientific and well-controlled studies 
also alleviates the heavy economic burden that companies belonging to the developed 
world have to bear. As a matter of fact, there are many institutions and young scientists 
in South America and Ecuador that are waiting their opportunity to collaborate in stud- 
ies not only of plant extracts, but of pharmaceuticals in general. In this sense, with rigid 
regulation and extensive monitoring, two low-cost procedures that can be easily imple- 
mented at these locations, this strategy will prove to be a cost-saving maneuver, since 
studies can be done at a fraction of their cost, if compared to the cost of their perfor- 
mance in the developed world; nonetheless, the quality of their results can be preserved 
and guaranteed. 

These simple facts and considerations should lead environmental scientists, econo- 
mists, researchers, and investors to consider Ecuador as one of the most rewarding 
places for plant natural-product research in the present and in the future. 
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